WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




per 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 5 : 

C12N 15/00, C12P 19/34, 21/06 
C07H 21/00 



Al 



(11) International Publication Number: WO 91/16427 

(43) International Publication Date: 31 October 1991 (31.10.91) 



(21) International Application Number: PCT/US9 1/029 10 

(22) International Filing Date : 24 April 1 99 1 (24.04.9 1) 



(30) Priority data: 

513,957 



24 April 1990 (24.04.90) 



US 



(71) Applicant: STRATAGENE [US/US]; 11099 North Torrey 

Pines Road, La Jolla, CA 92037 (US). 

(72) Inventors: SHORT, Jay, M. ; 320 Delage Drive, Encinitas, 

CA 92024 (US). SORGE, Joseph, A. ; 17021 Circa Del 
Sur, Rancho Santa Fe, CA 92067 (US). 

(74) Agents: BIGGS, Suzanne, L. et al.; 611 West Sixth Street, 
34th Floor, Los Angeles, CA 90017 (US). 



(81) Designated States: AT (European patent), AU, BE (Euro- 
pean patent), CA, CH (European patent), DE (Euro- 
pean patent), DK (European patent), ES (European pa- 
tent), FI, FR (European patent), GB (European patent), 
GR (European patent), IT (European patent), JP, KR, 
LU (European patent), NL (European patent), NO, SE 
(European patent). 



Published 

With international search report. 



(54) Title: METHODS FOR PHENOTYPE CREATION FROM MULTIPLE GENE POPULATIONS 



(57) Abstract 



Methods of producing biological agents which express a desired identifiable phenotype are provided. These methods in- 
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quences, each comprising one member of each population, expressing the combined nucleotide sequences to give a phenotype 
and identifying those biological agents expressing the desired phenotype. 



i 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


Austria 


ES 


Spain 


MG 


Madagascar 


AU 


Australia 


Fl 


Finland 


ML 


Mali 


BB 


Barbados 


FR 


France 


MN 


Mongolia 


BE 


Belgium 


GA 


Gabon 


MR 


Mauritania 


BF 


Burkina Faso 


GB 


United Kingdom 


MW 


Malawi 


BG 


Bulgaria 


GN 


Guinea 


NL 


Netherlands 


BJ 


Benin 


GR 


Greece 


NO 


Norway 


BR 


Brazil 


HU 


Hungary 


PL 


Poland 


CA 


Canada 


IT 


Italy 


RO 


Romania 


CF 


Central African Republic 


JP 


Japan 


SD 


Sudan 


CG 


Congo 


KP 


Democratic People's Republic 


SE 


Sweden 


CH 


Switzerland 




of Korea 


SN 


Senegal 


CI 


Cole d*I voire 


KR 


Republic of Korea 


SU 


Soviet Union 


CM 


Cameroon 


LI 


Liechtenstein 


TD 


Chad 


CS 


Czechoslovakia 


LK 


Sri Lanka 


TG 


Togo 


DE 


Germany 


LU 


Luxembourg 


US 


United States of America 


DK 


Denmark 


MC 


Monaco 







WO 91/16427 



PCI7US91/02910 



1 

DESCRIPTION 

Methods for Phenotvpe Creation 
From Multiple Gene Populations 

Cross Reference to Related Application 

This is a continuation-in-part application of copend- 
ing application Serial No. 513, 957 , filed April 24 , 1990 
which is a continuation-in-part of Serial No. 353,235, 
5 filed May 16, 1989, and Serial No. 353,241, filed May 17, 
1989, the disclosures of which are hereby incorporated by 
reference. 

Field of the Invention 

The present invention relates to methods for randomly 
10 combining populations of nucleotide sequences and select- 
ing those combinations coding for a desired predetermined 
phenotype . 

Background of the Invention 

The production of genetic variants, including vari- 
15 ants of both polypeptides and organisms such as bacteria 
and phage, has been a goal in the work of many individuals 
involved in recombinant DNA technologies. For example, 
researchers have beneficially relied upon random genetic 
recombination in the past for the production of new and 

2 0 useful microorganisms. Genetic recombination includes a 

variety of processes that produce new linkage relation- 
ships of genes or parts of genes. Genetic recombination 
is often subdivided into general genetic recombination, 
which takes place between homologous chromosomes, more or 
25 less anywhere along their length, and recombination that 
does not require extensive homology. The latter category 
includes site-specific recombination, which depends upon 
the existence of specific sites in one or more molecules 
and which includes interactions of viral genomes and 

3 0 insertion sequences with chromosomes of prokaryotes and 
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eukaryotes, and less well defined instances of recombina- 
tion that appear to require neither extensive homology nor 
special sites. Variable gene expression can also result 
in production of various combinations of polypeptides, the 
5 immune system being one example of such protein 
combination . 

The immune system of a mammal is one of the most 
versatile biological systems; probably greater than 1.0 x 
10 7 antibody specificities can be produced. Indeed, a 

10 great deal of contemporary biological and medical research 
is directed toward tapping this repertoire. During the 
last decade, furthermore, there has been a dramatic 
increase in the ability to harness the output of the 
immune system. The development of the hybridoma method- 

15 ology by Kohler and Milstein has made it possible to 
produce monoclonal antibodies, i.e. , a composition of 
antibody molecules of single epitope specificity, from the 
repertoire of antibodies induced during an immune 
response. Monoclonal antibodies have been generated in 

2 0 the past from hybridomas, generated by fusing antibody- 

secreting lymphocytes with an immortal cell line, such as 
myeloma . 

Although standard hybridoma technology has been 
extremely valuable, the screening of fused cells to iden- 
25 tify hybridomas expressing useful antibody molecules is 
labor intensive, time consuming and expensive. Moreover, 
the standard technology yields rodent antibody molecules 
that have two clear disadvantages. The first is that 
subtle variations in certain human antigenic systems, such 

3 0 as major histocompatibility proteins, are not easily dis- 

tinguished by non-primate antibodies. Therefore, rodent 
antibodies may not provide the repertoire of specificities 
needed to distinguish certain polymorphic antigenic deter- 
minants. In other words, current methods for generating 
35 monoclonal antibodies are not capable of efficiently 
surveying the entire antibody response induced by a 
particular immunogen. Thus, in an individual animal there 
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ub 3ABq Abui saxpoqxq.UB qons q6noqq.iv -ua6xq.uB paq.oaias b 
jloj oijioads Apoqxq.UB q.uapo«i b uioaj suot63J aiqBXJtBAaadAq 
qq.xw (shod) suoxfiaj aiqBXJtBAaadAq aqq. paoBidaa ge 
pUB Apoqxq.UB UBUinq b ua^^q. aABq saaqo.XBasaa amos 'suxBuiop 
aqq. jo suox6aj[ ^aowaureaj q.uapo.1 aqq. q.suxB6B paq.oao:xp 
asuodsaj oxdAq.oxpx-xq.ue aqq. ptoab oq. q.a:ojja ui 

• Apoqxq.UB 

UBumq aand b q.suxB6B paq.oa.xxp asuodsajt uxbuiop eiqeiJ^A b oe 
uBqq. ajtaAas aaoui sx q.Bqq. asuodsaa b q.xoxia nxq.s suxbuiop 
^aoMauiBJCj uox6a-i aiqBxaBA paAxaap-q.uapoa aqq. 'JtaAaMOH 
•snao g 6uxonpo.id-Apoqxq.UT2 0xdAq.0sx-xq.u12 jo uoxq.Binuixq.s 
aqq. 6uxpxoAB 'asuodsaa aunuiuix uxinqoib-xq.UB UBumq aqq. 
saqsxuxuixp suoxfiao: q.uBq.suoo UBumq jo asn aqx -Apoqxq.UB gz 
UBumq b uioaj paAxaap suxbuiop uoxbaa q.uBq.suoo iBUXuiaaq. 
Axoq.XBO aqq. pue Apoqxq.UB ibuoioououi quapoj oxjxoads 
b uioaj paAxaap suxBqo AABaq pub q.q6xi aqq. 30 suxbuiop 
aiqep^A iBUXULxaq. ouxuib aABq sainoaioui asaqi, *suxaq.o.id 
Apoqxq.uB uoxsnj ssaadxa oq. A6oiouqoaq. YNCI q.UBUxquiooaa asn oz 
oq. uaaq sBq snao BiuopxaqAq UBumq jcoj paau aqq. sq.uaAUinoaxo 
q.Bqq. qoBoaddB auo -AJoq.OBjsxq.BS UBqq. ssai snao 
6uxonpoad-Apoqxq.UB azxiBq.j:ouimx oq. pasn spoqq.aui p.XBpuBq.s 
•saxpoqxq.UB ibuoioououi UBUinq amd aonpoad puB Ajxq.uapx 
oq. q.inoxjjxp aq.xnb sx q.x ' Aiaq.Bun^:i:ojun -Apoqxq.UB ssa;tdxa gi 
q.Bqq. snao uBiunq aonpojtd oq. sx aAxq.BUJcaq.iB auo 

•sasodand oxq.nadB.iaqq. jo oxq.sou6Bxp 
xoj sq.uaxq.Bd ux asn panuxq.uoo axaqq. apnioaad ubo 
Pub 'suBiunq ux oxua&ounuiutx Aiq6xq aaB saxpoqxq.UB q.uapoa 
q.Bqq. sx ABoiouqoaq. BuiopxaqAq ux ^oeqwBjp joCbui puooas OX 
V -uoxsnj aad saxpoqxq.UB ibuoioououi q.uaaajjxp paapunq 
*aj b Axuo piaxA AiiBOXdA-q sax6oiopoqq.aui BUiopxaqAq q.uaaano 
'saxpoqxq.uB q.uaaajjxp xoj iBxq.ua^od q.sBA sxqq. oq. q.sBaq.uoo 
ui •paq.BJtauafi aq Abui sainoaioui Apoqxq.UB anbxun jo aaquinu 
paq.xuixiun ub AnBxq.uassa 'Aq.xsJcaAxp Apoqxq.UB jo uoxq.Baaua6 g 
aqq. Buxanp uoxq.Bq.nui oxq.Binos 3:0 ssaooad aqq. 3.0 asnBoaq 
'aaqq-anj • louaqdoa^xuxp aidmBxa jcoj 'sb qons ' suabounuiuix 
pxBxJi: AiaAxq.Biaa hbuis b oq. saxpoqxq.uB anbxun 6uxq.Bjaua6 
jo aiqBdBO sauoio nao-g q.uaaajjxp OOO'OT-S q.SBai q.B a^B 

Z 
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•sxsAxeq.eo oq. pasoddo se 'uoT^BZiiB^neu jo sseoojd aqq. ux 
uoxq.edxoTq.aHd aoj paubxsap Axq.uaaaqux saxpoqxq.ue Aq.xux:pje gc 
q6xq uoiqonpojd aqq. spae/*oq. ' anbxuqoaq. euiopxaqAq aqq. 

qons ' sanbxuqoaq. Buxuaaaos axqexxeAe Axq.uaaano jo sexq 
aqq. aq oq. paAaxxaq sx UOSB9J aaqq.ouv -Aq.XAxq.oe paaxsap 
aqq aoj 9jxoq.a8dej: aqq. jo uoxq.aod aBaex e uaajcos oq. 
Aq.xxxqeux aqq. sx 'AjeAoosip iBiiq.oe axaqq. aaq.je uaAa aqep oe 
oq. Aueui aq.exosx oq. aanxxej sq.x pue 'aaxx^ea ajxoqaadaj 
XeoxBoxounuiuix aqq. uioaj saxpoqxq.ue oxqAxeq.eo aq.exosx 
oq. paxxej qjB aqq. uoseaa auo q.eqq. paAaxxaq sx q.i 

# (696X) ' A*N 'TTXH-AT2JEDOW 
'ADoxouiAzua pue Aaq.sxuiaqo ux sxsAxeq.eo ux s^ouar *d SZ 

Aq ofie saeaA OZ q.souixe paq.exnq.sod uaaq AxsnoxAaad peq 
'sauiAzua a^xx Buxq.oe '-a*x 'uoxq.oeaa x^oxuiaqo BuxzAxeq.eo 
e jo axqedeo saxnoaxoui Apoqxque 'aonpoad oq. aaxoqaadaa 
aqq. aonpux oq. Aq-XXT^ *° eouasaad aqj, • (686X) 

Ofrfr-Z.£fr:frfrZ 4 -xos epuer .' (986X) eZ.SX-OZ.SX : VZZ OZ 

,T PS '-T« V> ^oexxod (986X) 0Z.SX-99SX : ' - xos '*T« 

q.a oueqxrouieai, -Aq.XAxq.oe oxq.Axeq.eo BuxAeq suxxnqoxfiouriuiuix 
jloj Buxpoo sauaB uxeq.uoo oq. punoj uaaq Axq.uaoaa 
seq saq.eaqaq.aaA jo aaxoq.aadaa oxfioxounuiuix aq^ 

•C20S2X0 -on uoxq.eoxxqn<i q.uaq.ed gx 
ueadoana '*x« q-© AxXT^O P UB *X89'frXZ.'fr "on q.uaq.ed *S*Q 
'Buxpean 2 00fr6£Z0 # otf uoxq.eoxxqrid q.uaq.ed ueadoana 
q.a aaq.uxM i 6600/68 OM *on uoxq.eoxxqrid JOd /- X^ uosuxqon 
'axduiexa aoj 'aas -uiaq.sAs uoxssaadxa oxq.oAaeooad e oq.ux 
q.saaaq.ux jo euiopxaqAq aexnoxqaed e uioaj saua6 Buxonpoad ox 
-uxxnqoxbounuiuix aqq. auoxo oq. uaaq seq suraxqoad o^q. 
aaq.q.ex asaqq. amooaaAO oq. paq.dmaq.q.e seq q.ae aqq. qoxqw. Aq 
sueaui auo •saanq.xno euiopxaqAq jo Aq.xoedeo uoxq.onpoad aox 
pue Aq.xxxqeq.sux oxq.auaB apnxoux A6oxopoqq.aui euiopxaqAq aqq. 
qq.XA saxpoqxq.ue x^uoxoououi Buxonpoad ux s^oeq^eap aaqq.o g 
' Aq.xoxua6ounuiuix pue Aq.xoxjxoads aaxoq.aadaa uioaj apxsv 

•BuxBuaxx^^ AxT^OTuqoaq. sx auaB uxxnqoxBounuiuix ueuinq 
e oq.ux SHOD q.uapoa XT^ 6uxq.jea6 jo ssaooad aqq. 'Apoqxq.ue 
q.uapoa q.uaaed aqq. oq. axqeaeduioo ua5xq.ue aoj Aq.xuxjje 
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jo suoxq.exndod aqj, *paq.eaedas aae aouanbas apxq.oaxonu 
q.uaaed 11012a uiojj paAxaap seoxxdaa asaaAxp jo suoxq.exndod gz 
buxq.xnsaa aqq. 'uoxq.eoxxdaa aaq.je 'axnoaxoui VNO euo asxad 
-uioo saouanbas apxq.oaxonu q.uaaed aqq. aJtatfM -saxnoaxoui VNO 
aq.eaedas asxaduioo saouanbas apxq.oaxonu q.ua:ied aqq. AxaAxq. 
-euaaq.x^ jo axnoaxoui YNd ajBuxs e asxaduioo Aeui saouanbas 
apxq.oaxonu q.uaaed aqq. 'q.oadse auo ui -seoxxdajr asaaAxp 0£ 
jo uoxq.exndod e ux sq.xnsa.1 pue aouanbas apxq.oaxonu q.uaaed 
auo uioaj Aq.xsaaAxp saq.e;iaua6 qoxq/* jrnooo oq. suoxq.eq.nui 
moix^ qoxq/\ suoxq.xpuoo Jtapun aouanbas apxqoaxonu q.uaaed 
e 6uxq.eoxxdaa Aq paq.eaaua6 sx seoxxdaa asaaAxp jo uoxq.ex 
-ndod e 'AiaAxq.Buaaq.lv -seoxxdaa asaaAxp jo uoxq.exndod e gz 
aAxb oq. paq.eoxxdaa sx qoxq*\ saouanbas apxq.oaxonu asaaAxp 
jo (Ajxurej jo) uoxq.exndod e sasxaduioo Anexq.xux aouanbas 
apxq.oaxonu q.uaaed qoea 4 q.uauixpoquia auo ui -saouanbas apxq. 
-oajonu q.uaaed jo seoTidsj asaaAxp jo uoxq.exndod e spxaxA 
daq.s 6uxq.eoxxdaa aq,i • saouanbas apxq.oaionu q.uaaed owq oz 
jo suoxq.aod q.seax q.e 6uxq.eoxxdaa q.saxj asxaduioo spoqq.aui 
asaqq. sq.uauixpoquia auo ui • saouanbas apxqoaxonu q.uaaed aqq. 
Aq passaadxa Axx^^ou l ou S T H^T 1 !* adAq.ouaqd e Axx^oxdAq. 
sx adAq.ouaqd paaxsap am, -saouanbas apxq.oaxonu pauxquioo 
jtaqq.o BuxAeq sq.ua6e x^oxboxoxq uioaj uiaqq. qsxnbuxq.sxp gx 
pue aouanbas apxq.oaxonu pauxquioo aexnoxqaed aqq. buxAeq 
sq.ua6e x^^T 6o TOxq aqq. Ajxq.uapx oq. pasn aq ueo adAq.ouaqd 
pxes uxaaaqM pue aouanbas apxq.oaxonu pauxquioo jPinoi^JHd 
e jo uoxssaadxa uioaj sq.xnsaa adAq.ouaqd sxqq. uxaaaq^ 
adAq.ouaqd isaou paaxsap e buxAeq sqxtabe x^oxboxoxq buxonp ox 
-ojd joj spoqq.aui oq. paq.oaaxp sx uoxq.uaAux q.uasaad aqj, 

uoxq.uaAUi aqq. jo Aaeuiuins 

• sapxq.dadAxod 

pue saouanbas apxq.oaxonu qons jo buxuaaaos pue uoxq.onpoad 
aqq. jcoj saqoeo;rdde aAxq.euaaq.XB pansand aAeq aA 'saxpoqxq.ue g 
Xeuoxoououi ueuinq aand jo uoiqonpojd pue uoxq.eoxjxq.uapx 
aqq. buxpnxoux ' sapxq.dadAxod paq.eaauab Axuiopuea asx/uaqq.o 
jo uoxq.euxquioo paaxsap aqq. jo saouanbas YNQ paaxsap jo 
uoxq.euxquiooaa paubxsap aqq. aouequa oq. q.duiaq.q.e ue ui 
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BuxABq SII30 aq.B3JO oq. 'ajojajenq. 'pazxxxq.n aq ubo spoqq.aui 
asaqj, • suoxq.Bxndod q.uaj£Bd aqq. 3:0 qoBa uiojij saouanbas apxq. 
-oaionu feuTUt^uoo sauoxo jo A:iB.xqxx b ulxoj oq. pauxquiooa.2 ge 
aq ubo qoxq/* saouanbas apxq.oaxonu jo suoxq.Bxndod 6uxq.;iBq.s 
o/*q. q.sBax q.B azxxT^ n spoqqaui aqq. 'qoadsB auo ui -sauoxo 
jo uoi^oaxioo asjcaAxp b 9^ieu96 oq. uiaq.sAs uoxq.oaxas 
A.XBaqxx x^T^ 0 ^-^ 11 ! 0 ? 1100 13 3-° esn et H- apnxoux Axx^auab 
spoqq.aui asaqj, *sadAq.ouaqd x^aou buxABq sxx a ^ jo buxuoxo OZ 
aqq. J.03 spoqq.aui isaou sapxAoad uoxquaAUX q.uasaad aqi 

•saxnoaxour aaqq.o qq.x/\ uoiq.OBJ3q.ui 
aoj q.3baBq. b qoe Axduixs Abui YNH VNa aq^ iAq.TAxq.oB 
aoqdaoaj jo Aq.XAxq.oe oxq.BuiAzua q.xqxqxa Abui VNH -*o sapxq. 
-dadAxod aqq. aaqq.xa ' Axx^ uo T^O •saxnoaxoui VNH a;ioui gz 
auo jo sxsaqq.uAs asxjduioo Abui adAq.ouaqd aqq. 'AxaAxq.Buaaq.xv 
•axqBq.oaq.ap sx qoxqw jo uoxssaJtdxa aqq. ' apxq.dadAxod 
ox.xauixq.xnui b ho 1 apxq.dadAxod auo UBqq. a.zoui 'apxq.dadAxod 
b asxaduioo Abui adAq.ouaqd paaxsap aqj, -saouanbas apxq. 
-oaxonu pauxquioo o:aqq.o 6uxssa,zdxa sq.uabB x^oxboxoxq uiojcj oz 
aouanbas apxq.oaxonu pauxquioo aBxnoxq.;iBd aqq. buxssaadxa 
sq.uabB xb^T 60 ! 0 ! 0 ! asoqq. qsxnbuxq.sxp oq. pasn sx adAq. 
-ouaqd aqq. 'paaxsap ji •paxjxq^uapx aaB adAq.ouaqd paaxsap 
aqq. ssajdxa qoxq/* sq.uabB x^oxboxoxq aqq. 'uaqi, *q.ua6B 
XBOxboxoxq auiBs aqq. ux saouanbas apxq.oaxonua qq. jo aoua gx 
-q.sxxa aaaui aqq. aq.nq.xq.suoo Abui uoxssa^dxa >q.uauixpoquia auo 
ui -uxa^aq paqxaosap aaB uoxssaadxa aoj suBaui 3iqeqins 
qons puB 'ioqo3A pxoB oxaxonu jo 'siuia ao abBqdox.xaq.OBq 
'pxuiSBXd ^ ' Ax3Axq.Buaaq.x^ ' TT&o qsoq b asxadmoo Abui 

sq.uabB x^oxboxoxq qons *sq.uabB x^oxboxoxq ux passajcdxa ox 
a^B saouanbas apxq.oaxonu pauxquioo aqj, "saxaBaqxx 
XBX^oq.BUxquioo jo uoxq.BJtaua6 pub saouanbas apxq.oaxonu 
jo uoxq.BUxquiooa.1 puB uoxq.BOTXda;r aoj uxaaaq paqx^osap 
spoqq.aui suotiba aqq. oq. buxpjtoooB aaqq.aboq. q.qbnoaq 
puB paq.BOXxdaa Axq^T ns ec l i ^ 13UI saouanbas apxq.oaxonu g 
q.uajBd aqj, •SBOxxda.x asaaAxp jo uoxq.Bxndod qoBa jo aaquiaui 
auo sasxaduioo aouanbas apxq.oaxonu pauxquioo qoBa uxa^aq/^ 
saouanbas apxq.oaxonu pauxquioo aonpoad oq. 'aauuBui uiopuBa b 
ux AxqB^cajaad 'jaqq.a6oq. q.q6noaq uaqq. aaB sBOXXdaa asaaAxp 

9 
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•suosodsusaq. jo asn aqq puB 'id absqd 
-oxaaqoBq jo uiaqsAs xoT-aao aqq 'uoxqBUxquiooaa pazATBqBO 
-asBuxquiooaa df! 'suxaqoad paq.BT9-i-9sea6asq.uT buxpnioux ge 
spoqqaui Aq uoxqBUxquiooaa ox jxoads-aqxs puB 'uoxqBUxquiooaa 
snoboTouioq 'uoxqBOXT. Pub uoxqsaBxp uoxqoxaqsaj: jo asn aqq 
apntoux saouanfias YNd paaxsap ouxuxqmoo jo spoqqaw 'TT^o 
qsoq aqq ux soxqsxaaqoBaBqo oxdAqouaqd jo suoxqBUxquioo 
taAou oaxa ux 9^B3ao oq snao qsoq paaxsap oq paaaaxsuBaq 0£ 
eq uaqq ubo saoqoaA asaqj, •oxqsxaaqoBaBqo aiqB 

-xixquapx aqq aqsaao oq Atxuibi auab qoBa 10 aaquiaui auo jo 
quauiqaosss uiopuBa b uxBquoo Abui Aqaaaqq paonpoad saoqoaA 
aqq pus saxixuiBj auab OAvq usqq aaoui aq Abui aaaqq ATaAxq 
-BuaaqiV •suoxqBTndod quaJBd aqq 10 qoBa uioax saouanbas gs 
VNa buxuxsquoo sauoxo JO AaBaqxx e uiaox oq 'uxaaaq 
paqxaosap asoqq buxpnToux 'sanbxuqoaq iBaaAas jo Aub Aq 
pauxquioo AnBOXsAqd aaB saoqoaA buxuxBquoo-aouanbas VNO jo 
suoxqBTndod fiuxqaBqs o^q qsBaj qs ' quaurcpoquia auo ui 

•sboiouioq aqq jo AqxAxqoB iBOxfioioxq aqq OZ 
suxBquxBui qoxqw 'a^xT. aqq pub auxTBS paaajjnq aqBqdsoqd 
'aaqBM sb qons 'uoxqBxndxuBui oaqxA ux xoj aiqBqxns uinxpaui 
b ux quasaad aaB sboiouioq aqq 'Atqeaajaaa • quBuxuiaaqap 
oxuabxquB paAJtasuoo b aasqs qoxqA jo AqxTBanTd b 'sboiouioq 
VNO buxpoo-^ ao/puB- H A S 0T qsBai qB AiqBaajaad aaoui pub gi 
OX qnoqB qssax q* AiqBJtajaad ' £ 0X qnoqB qssai qB jo aanq 
-xxuipB paqBiosx ub £>uxsxaduioo AjtBaqxi auab b saqBiduiaquoo 
uoxquaAUX quasaad aqq 'quauixpoquia aaqqouB ui 

•anbxuqoaq BUiopxaqAq aqq jo saxoBnbapBUX 
pauoxquaui-aaoiaq aqq buxuiooaaAO Aqaaaqq 'aiqxssod ox 
uaaq aaojoqaxaq sBq usqq AqxAxqoB paqoaiasaad b buxABq 
sioqdaoaj aoj aaxoqaadaa iBOXboTounuiuix aqq ?o uoxqaod 
aaBaBT h uaaaos oq pasn aq ubo spoqqaui asaqq 'uoxquaAUX 
aqq 50 qoaCqo aaqqBi sq^- q^-T* aouspaoooB ui -saino 
-aioui uxinqoTbounuiuix quaqadmoo AiiBoxboioxq 30 uoxqBUiaogt g 
aqq ux pazxixqn sapxqdadAiod aqq buxpmoux ' sapxqdadAiod 
jo suoxqBuxquioo wau 3.0 uoxqonpoad aqq aoj pasn aq osib 
ubo spoqqaui asaqi -sapxqdadAiod passaadxa 30 uoxqBUxquioo 
paaxsap pus asu b buxABq stt^o 'sx qBqq 'sadAqouaqd xaAOU 
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aoj sixao q.soq aiq^q/cns *s:roq.oaA afiisqd ' AxaAxq.mi:iaq.x^ 'jo 
sio^oaA pxutsoo jo ppus^xd apnxoux uot^usaut q.uasajcd aqq. 
jo spoqq.aui aqq. oq. Buipjoooe asn .10 j sjo^osa axqeq.Tns ge 

•saxxxuiBj 9ua6 px^s uio:rj sjtaquiaui 
aua6 jo suoxq.i2Uxquioo pajrxsapun Butut^uoo uoxq.^xndod 
aqq. uxqq.XM saoq.oaA aaqq.o uioaj jojoba aqq. qsxnBuxq.sxp 
oq. oxq.sx:iaq.OB:reqo pies Buxsn pu^ 'saxxxui^g: auaB Jtaqqo 
px^s jo qo^a uioaj auaB auo pu^ Axxurej auaB q.s:ixj px^s oe 
uiojrj auaB auo jo uoxsnxoux aqq. uioaj Buxq.xnsaj; oxq.sxaaq. 
-OBJBqo pajcxsap e Buxonpojd Axq^3«oaq.ap jo axq^deo aoqoaA 
"B sjo^osa jo uoxq^x^idod PT^ S uxqq.XM 6uxAjxq.uapx (q 

isaxxjurej auaB aaqq.0 pxiss jo qoea uioaj 
saua6 aqq. jo auo pue Axp^iJ ^U96 q.s.zxj pies uiojcj sauaB gz 
aqq jo auo utb^uoo Abui :toq.oaA qoca uxajcaq** paq.i2a:io sjb 
saoqoaA jo uoxq.Bxndod e q.^qq. os sauaB jo saxxT^J J®xo.o 
aaoui Jio auo uioaj aaquiaui auo puB sauaB jo Axxurej q.saxj 
b jo jaquxaui auo saoq.oaA oq.ux 6uxq.aasux Axuiopue:t (h 

:sasx.xduioo oz 

poqq.aui qoxqtt 'sauaB jo saxxT 13112 ? jo sjaquiaui jraqq.o 

Buxpooua sjo^oeA jaqq.o uioaj sauaB px^s Buxpooua jojoba aqq. 
Ajxq.uapx oq. pasn aq ueo qeqq. oxq.sxaaq.oi2:n2qo e Buxonpoad 
jo axq^deo Buxaq sauaB pies 'sauafi jo AxT^J B uioaj qoea 
sauaB pajxsap aaoui 10 o*\q. Buxpooua aoqoeA PT°^ oxaxonu b gx 
Buxonpoad jo poqq.aui is Buxpnxoux 'sadAq.ouaqd jo uoxq.eaauaB 
X^xjtoq.^uxquioo aqq. joj spoqq.aui sapnxoux osx^ uoxq.uaAUX 
q.uasaad aqq. 'snqj, • sasesodsuwq Aq pazxuBooaa saouanbas 
q.nadaa x^uxuuaq. P UB 'sa6eqdox.2aq.oeq pxopqurex uio«zj suxaq.oad 
paq.BX3^c-9SHJcBaq.ux Aq pazxufiooao: saouanbas q.e 'saouanbas oi 
uoxq.euxquiooa:i dxox x^uoT^- 011 * 1 ? 'saouanbas uoxq.euxq 
-uiooa;i dxj x^uoxq.ounj J° asn ei W apnxoux saoqoeA asaqq. jo 
uoxq.euxquiooaj[ aqq. Buxxq^ua saouanbas •uoxq.exndod Axparej 
auaB Buxq.«xeq.s qoea uiojej auaB auo q.seax vs jo aaqq.aBoq. 
uoxq.euxqniooa.x uiopuea 'oxjxoads-uoxBaa aqq. aq.eq.xxT 013 J £ 
qoxqM saouanbas VNa 'suoxq.^xndod AxT^J Q^aB Buxq.j:Bq.s 
aqq. uiojcj saouanbas YNCI uiopuca oq. uoxq.xppe ux 'asxaduioo 
qoxq/^i spoqq.aui aqq. ux asn jloj saoq.oaA saq.^x<3uiaquoo 
uoxq.ua aux quasejd aqq. ' q.uamxpoqnia aaqq.^nj XIT^ S e U I 
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oj^ia ut Aq (suoTq.T2indod 10) saxipirej aouanbas apxq.oaionu 
aaoui jco o^q. jo suot^uiqwoo jo uoi^ejo aqq. saq.eT*uaq.uoo 
uopueAUT q.uasaad aqq. '^uauixpoquia jraq^ang: TTT^ S e U I 

•aaq.ouioJtd uwo axaqq. uioaj passaadxa 
q oea ejp saxixureg: aua6 aqq. uioaj saua6 aqq. ' AiaAxq.T2u:iaq.TY 
•saxixurej auab aqq. uioaj sauaB aqq. sassaadxa q.T2qq. ,xaq.ouioad 
ai6uxs e ut^uoo uot^uoaut q.uasa:id aqq oq. fiuxpaoooe 
paonpoad sao^oaA aqq. '^oedsB paajtajaad I^^^ouB ui oe 

•aq.xs buxpuxq 

ua£>xq.ue suucoj sapxq.dadAiod qq.oq jo uoxq.oeaaq.ux aqq. q.eqq. 
qons saua£> Apoqxq.ui2 uicxj paAxaap a;re sapxq.dadAiod asoqq. 
AiT^uoxq^do -ainoaTOui j:aqq.oue jloj aq.xs Buxpuxq v uiaoj oq. 
^oMaqui An^oxsAqd q.eqq. sapx^dadAiod ao ainoaxoui aunmuix gz 

Aq pazxufiooaa ado^xda-oau b uiaoj oq. pw3;ux An© 
-oxsAqd q.sqq. sapxq.dadAiod ssaadxa Ahui saua6 qq.og *oxq.sx 
-ae^OBJ^o aiqexjxq.uapx ue eqeaio oq. q.soq 12 xiq.XM qonaaqux 
An^oxsAqd sapxq.dadAtod ao/pue svNH PT^ S P u * apx^dadAiod 
jo/pue VNH ssaadxa sauafi qq.oq 'poq^aur uoxq.T20xjx:quapx oz 

paaaajaad i=> ui -adoq.xda aoj aq.xs Buxpuxq ainoaxoui 

aunuiuix ue jo 'Apoqxqxre ub Aq pazxufiooaa adoq.xda 
'axnoaioui :taqq.ou*? aoj aq.xs Buxpuxq 12 uiequoo Aeui ainoaxoui 
uiaqoad aiqi=>xjxq.uapx ut? qons 'AiT^uox^do -ainoaxoui 
uxaq.oad axq^X3xq.uapx ub sauab qq>oq 10 auo jo uoxssaadxa gi 
aqq. (p) puB .' ainoaioui YNH st3 saxixureg: aua6 aqq. uioaj 

sauab qq.oq jco auo jo uoxssaadxa aqq. (o) .'saxixurej auab 
aqq. uioaj saua6 qq.x/\ saqoad pxoe oxaxonu 3:0 uoxq.HZxpxaqAq 
aqq. (q) :pauxquioo aJta/\ qoxq/* saxixurej xenpxAxpux aqq. 
uioaj saua6 qq.x/a spxoe oxaxonu oxjxoads-aouanbas 3:0 uoxq.oe ox 
-Jia^ux aqq. (e) apnioux spoqq.aui asaqj, -uoxq^indod aoq.oaA 
qAreuxqutooaa aqq. uxqq.xM aoq.oaA j^inoxq^d e jo uoxq.eoxg: 
-xq.uapx aqq. jtoj pasn aq Aeui spoqq.aui aiqi2q.xns suotj^a 
» oaxa ux pauxquioo aae sauafi aqq. ' Aiqeaa^aaa 

•xiod S 

•3 asxadmoo snao q.soq pue aficqdoxaaq-oeq epqui^i asxaduioo 
saoq.oaA aq^ 'q.oadsc paaaajaad auo ui -sTiao uexT^uruiBiu 
apnioux snao oiqoAj^na paaaajaad -stT©s> ox^oAaexoad 
jo snao oxq.oA:iT23[na aaqq.xa asxaduioo sxoqoaA aqq. £>uxssaadxa 
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sBoiidaj esjeAip jo suoxq.exndod Axaureu 'AqxsaaAxp buxq.ea.io 
jo spoqq.aui oq. paq.oa;ixp st uoxq.uaAUX quasa.id aqq 'sum, 
•Axxurej aouanbas apxq.oaxonu aaqq.o aqq. Aq papooua sx Jtauixp gz 
aqq. jo q.xunqns ,xaqq.o aqq pue Axxuiej aouanbas apxqoaxonu 
auo Aq papooua sx jauixp aqq. jo ^xunqns auo a;raq*\ suxaqoad 
oxa:auixpojtaq.aq jo uoxq.ea.io aqq. asxaduioo pxnoo sadAqouaqd 
qons 'axdurexa Jtoj; -uoxq.exndod qoea uioaj aouanbas apxq. 
-oexonu e buxsxjtduioo suoxqeuxquioo jexnoxqaed jo uoxssaadxa oe 
qqxtt Axuo q.uasa:id aq pxno*\ pue 'passaadxa aia^ saouanbas 
apxqoaxonu jo uoxqexndod suo Axuo uiojej saouanbas aqq. jx 
quasaad aq q.ou pxno/\ qeqq. sadAqouaqd Aeu joj pauaaaos aq 
uaqq ueo saouanbas passajdxa jo uoxqextidod x^T- Z0 ^ u T c [ ut00 B 
qons •saxxxurej aouanbas qqoq uiojj aouanbas apxqoaxonu aqq gz 
jo uoxssaadxa aoj jo^osa uoxssaadxa axbuxs e oqux paqjtasux 
Axquanbasqns pup 'paonpoad aq ueo saouanbas apxqoaxonu 
uoxsnj jo uoxqextidod x^T« xo: l» BU T c n u00 ^ 'snqj, -uoxqeuxquiooaji 
qons ux aq.edxoxq.asd ueo saouanbas apxqoaxonu aqq jo 
XT^ - 10 qsoui uaqq ' suoxq^xridod aouanbas apxqoaxonu o/*q aqq qz 
jo XT^ -3° qsoui uo quasaad sx Aboxouioq jo uoxbaa paaeqs aqq 
j I • (aaaq TZ aanbxj aas) suoxqexndod aouanbas apxqoaxonu 
qqoq uiojij saouanbas uxequoo XTT A qoxqtt. paqeaao sx qonpojtd 
uoxsnj e uaqq 'spuwqs Aaequauiaxduioo jo sxsaqquAs uxbaq 
oq aseaauiAxod e Aq pasn aq ueo qeqq „xaxduioo aqexdmaq sx 
-jcauixjid,, e saqeaao buxjixed aseq qons ji -saouanbas 
apxqoaxonu jo suoxqexndod qqoq uioaj saxnoaxoui pxoe oxaxonu 
papueaqs axbuxs uee^qaq Jtnooo oq. BuTJiBd-aseq jioj HDd 
aqq buxanp axqxssod sx qx uaqq 'Aboxouioq jo uoxbaa e aaeqs 
saouanbas VNO jo suoxqexndod owq jo qqoq ji • (HDd) uoxq ox 

uxeqo aseaauiAxod aqq se qons spoqqam uoxqeoxjxxdure 
pxoe oxaxonu oaqiA ux buxsn Aq sx oaqxA ux uoxqeuxquiooa,! 
snoboxouioq jo elioj e qsxxdmoooe oq. poqqaiu auo 

• uoxqeuxquiooaa 

snoboxouioq aqeqxxxoej oq. suoxqexndod aouanbas g 
apxqoaxonu qqoq Aq paaeqs saouanbas snoboxouioq Buxsn 
Aq ao ' (aseuxquiooaa: dxj a^lTl) saseuxquiooaa oxjxoads pue 
saouanbas q.abaeq. uoxq.euxqmooaJt oxjxoads buxsn q.no paxaaeo 
aq pxnoo uoxq.euxqmooaj oaq.XA ux qons •uoxq.euxquiooaa 
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H A uoifiea aiqBTJBA uxcqo AABaq uxinqoibounuiuix ue '^sjtj 
•saouanbas YNCI buxssaadxa-ApoqxquB usuinq isaou aonpoad 
oq pazxixqn eae spoqqaui aqq ' quauixpoquia aaqqous ui S£ 

•AiTBiaoqBUiquioo psq.Baaue5 aq Abui absqd isquaasd 
aqq uioaj qouxqsxp sadAqouaqd asjsAip buiABq absqd Abm 
sxqq ui -sadAqouaqd paaxsap 30 uoxssaadxa joj uoxqoaias 
Aq pawoxioj 'uoxqBbxi pus 'suoxqBxndod absqd 10 buxxxui Aq 
paAioxxox 'assaionuopua uoxq.oxaq.saa b Aq sb qons abBABaio oe 
Aq paqsaauab aq Abui sibux6xjco aqq uioag: qouxqsxp sadAqouaqd 
buxABq absqd -aBBqd asaaAip 30 (s) uoxqBindod b aq.Baaua6 
oq 'oxuabBqnui qsqwauios aaB qoxq/4 suoxqxpuoo 'paqsoxidaa 
AxiBOxq-Ai aq Abui absqd qouxqsxp AnBOiqauab aaoui jo auo 
'uoxquaAUX quasaad aqq 30 qoadss aaqqous oq buxpaooov 9Z 

buxpuxq uabxqus paousqua buiABq saxpoqxqus 
apnioux Abui sadAqouaqd xaAOU asaqi, -sadAqouaqd xaAOU 
aqsaauafe oq pasn pub (uxaaaq paqxaosap sb) HOd uoxsrti 
(aiduiBxa aoj) uoiqBoixiiduiB apxqoaionuAiod uoxsnj sb qz 
qons sanbxuqoaq buxsn pauxquioo aq Abui saouanbas paq.Bq.nui 3.0 
suoxqBindod isnpiAipux asaqq ' AxaAxq.Buaaq.TY -sadAqouaqd 
XaAOU aqsaao oq pasn uaqq aaB qoxq* saxaBaqxi x^T^oq-^uT^ 
-uioo 33(bui oq pasn aq Abui saouanbas paqBqnui 30 suoxqBindod 
XBnpxAxpux aqi -saouanbas paqBqnui jo uoxqBindod b aqsaao gx 
oq paqBaauab aaB saouanbas paqBqnui qsqq suoxqxpuoo aapun 
(HOd sb qons poqqaui b buxsn) paxxxiduiB AnsnpiAxpux aq Abui 
uxBqo AABaq aqq pus uxsqo qqbxx. aqq. aoj buxpoo saouanbas 
apxqoaxonu aq,L -auxi IT^o buxonpoad ApoqxquB-iBuoioououi 
uaouji b uioaj paqBiosx uaaq ssq qoxqw uxBqo AABaq 01 
b pub uxsqo qqbxx e qqoq aoj buxpoo saouanbas Apoqxqus 
buxuxBquoo pxuisBid B qqx* qasqs Abui auo 'aiduiBxa aod 

•AaBaqxi TBxaoqBUiquioo 
b ao (sauab jo) saouanbas apxqoax.onu 30 Aixuibj 
b qqxtt buxqasqs Aq ao sxsauabBqnui Aq sb qons 'AqxsaaAxp g 
aqBaao oq paqsaaq sx qoxqw ainoaioui YNCI aibuxs b qqx* 
buxqasqs paqsaauab aq Asm AqxsaaAxp qons -paqoaias aq Abui 
adAqouaqd paaxsap b qoxqw utoaj 'sadAqouaqd 30 AqxsaaAxp 
b aAxb oq pauxquioo aq Abui qoxqw saouanbas apxqoaionu 3.0 
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jo a.xxoq.Jtada;r e qq.oq jx 'asanoo jo -papiAOjd sq.uauiaxduioo 
auaB 6uxpoo- H A q.uaa:ajjxp aqq. jo Aq.xx^nxd e oq. sazxpxaqAq 
q.eqq. jcauixad sxsaqq.uAs apxq.oaxonuAxod e Buxsn uoxq.oeaa ge 
uoxsuaq.xa .xauixjid e oq. paq.oaCqns eis sq.uauiaxcfoioo aqq. 'a^xx 
aqq. pue sxsaqq.uAs i^oxuiatjo 'YNCI oxuiouaB pue.xq.s aiqnop 
jo uoxq.eanq.euap sb qons poqq.aui e Aq paonpoad sq.uauiaxduioo 
auaB 6uxpoo- H A q.uaaajjxp jo Aq.xx^J:t\xd e si x^T Jle ^ eui 
Buxq.:req.s aqq. ajaqtt 'sx q.eq£ *uoxq.oeajc uoxsuaq.xa aauixjcd 0£ 
puooas passnosxp-aAoqe aqq. oq. paq.oaCqns sx eaioqaedaj 
aqq. 'uoxq.oea.x uoxsuaq.xa .xauixjcd q-sjcxj aqq. ueqq. aaqq.o poqq.aui 
e Aq papxAoad sauaB Buxpoo- H A q.ua:rajjxp jo sq.uauiaxduioo 
jo Aq.xx^nxd e sx x^T^ 3 ^ 12111 Buxq.;teq.s aqq. aaaqM 

• sq.uaiuaxciuioo auaB Buxpoo- H A jo Aq.xx^arixd e Buouie gz 
paAJcasuoo aouanbas apxq.oaxotiu e oq. BuxzxpxaqAq Aq uoiqoeej 
puooas aqq. 6uxq.exq.xux oq. axqedeo sx q.eqq. aauixad sxsaqq.uAs 
apxq.oaxonuAxod puooas e sasn uoxq.oeajc uoxsuaq.xa Jtatuxad 
puooas aqj, •sBoxoiuoq YNCI Buxpoo- H A q.uaaajjxp jo Aq.xx^*tixd 
e 'saq-exduiaq. se sq.uauiaxduioo aqq. Buxsn 'seonpojd uoxq.oeaa qz 
uoxsuaq.xa aauixad puooas aqj, • (aa:xoq.aadaj aqq. ux sauaB 
aqq. oq. A;teq.uaraaxduioo spueaq.s pxoe oxaxonu) sq.uamaxdmoo 
Boxouioq 6uxpoo- H A q.ua;iajjxp jo Aq.xxBanxd e saonpoad 
uoxq.oeaa: uoxsuaq.xa jcauixad q-SJixj aqj, -a^xoq.aadaa aqq. 
uxqq.XA (sauaB jo Aq.xx^mxd e Aq paaeqs) paAaasuoo aouanbas gx 
apxq.oaxonu e oq. BuxzxpxaqAq Aq uoxq.oea;r q.saxj aqq. Bux 
-q.exq.xux jo axq^deo .xauixjrd sxsaqqxtAs apxq.oaxonuAxod q.saxj 
e sasn uoxq.oea;r uoxsuaq.xa aauixjd q.saxj aqj, -suoxq.oeaj: 
uoxsuaq.xa aauixad q.ouxq.sxp o/*q oq. paq.oaCqns sx aaxoq.aadaa 
aqq. 'sauaB 6uxpoo- H A jo Aq.xx^nxd e sx x^T^ 19 ^ 111 Buxq.aeq.s oi 
aqq. aaaq& * x^T- 13 ^ 111 Buxq.;ieq.s aqq. uo Buxpuadap ' spoqq.aui 
snoiiBA Aq pazxsaqq.uAs aq ueo Aaeaqxx auaB aqi 

* sBoxouioq 

YNd Buxpoo- H A se uxajtaq oq. paaaajaa spueaq.s pxoe oxaxonu 
6uxpoo- H A q^uaaajjxp s ox q.noqe q.seax q.^ Axq^aajaad ajoui g 
pue ^ox ^-tioqe q.seax q.^ Axq^aaja^d a^oui pue £ ox q^noqe q.seax 
q.e suxeq.uoo Aaeaqxx euaB 6uxpoo- H A axo. 4 sq.uamxpoquia paxiaj 
-ao:d ui *pazxsaqq.uAs sx aq.eaqaq.aaA e jo aaxoq.aadaa auaB 
H A aqq. jo uotqjod x^T^ u ' B ^ sc [ tls * Buxuxeq.uoo Aaeaqxx auaB 
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b 6uxuxB:quoo apxq.dadAiod b ' ( h a) uoxfiaa aiqBXJBA aqq. 
s^U9S3Jd9i uxBqo AABaq aqq. uo eaiB pajoaxo am -saanq-Baj ge 
TB:inq.on:iq.s TBdxouxad aqq. Buxwoqs eitioaiom uxinqoiBounuxuix 
aqq 30 uiBaBBxp oxqBuiaqos b saq.Baq.snnx x aanDX^ 
:amsoTOSTp sxqq 30 uox^jod b Buxulzoj s6ux/\Bap aqq ui 

sbuxttBaa &m jo uoxq.axaosaa jaxag 

•spoqq.aui pauiXBio aqq. jo sq.onpoad OE 
aqq. 3:0 Aub BuxsxJtduioo sq.x^{ puB paonpoad saoq.oaA 'spxoB 
oxaxonu pun sapxq.dadAiod qons spoqq.aui asaqq. jo sq.onpo.xd 
uoxssaadxa aqq. osib 'AqaJtaqq paonpoad sq.uaBB TBOX&ojoxq 
aqq. sh qons 'uoxquaAUX aqq. jo spoqqaui aqq. Aq paonpoad 
sq.onpoad sassBdutooua osib uoxq.uaAUX q.uasaad aqx QZ 

•paq.121du1aq.uoo ostb sx 'uxaaaq paqxaosap 
A £ jo h a ^ Aiqeaajaad 'uoxquaAUX q.uasaad aqq. jo poqq.aui b 
Aq paonpoad Aq.XAxq.oe paq.oaiasaad b BuxABq aoq.daoaa v 

•auaB paq.Biosx aqq. aonpoad 
oq. uoxq.Bindod aqq. uioaj paq.Bfia:iBas uaqq. sx Aq.XAxq.oe paaxsap oz 
aqq. Buxssaadxa q.UBinaojsuBaq. v -pasn aq ubo aoq.daoa^ 
b oq puBfixi b jo Buxpuxq aqq 6uxq.oaq.ap aoj qaB aqq. ux 
uaou^i IT^ spoqq.aui aiqeq/cns al W JO Auv -puBfixi paq.oaias 
-aad aqq. spuxq q.Bqq. aoq.daoaa b jo uoxssaadxa aoj pauaaaos 
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from the constant regions of spleen mRNA using primers 11 
and 13 (Table 1) . 

Figure 5 depicts nucleotide sequences of clones from 
the cDNA library of the PCR amplified V H regions in Lambda 
5 ZAP vector. The N-terminal 110 bases are listed here and 
the underlined nucleotides represent CDR1 (complementary 
determining region) . 

Figures 6A and 6B depict the sequence of the 
synthetic DNA insert inserted into Lambda ZAP vector to 

10 produce Lambda Zap II V H (6A) and Lambda Zap V L (6B) 
expression vectors. The various features required for 
this vector to express the V H and V L -coding DNA homologs 
include the Shine-Dalgarno ribosome binding site, a leader 
sequence to direct the expressed protein to the periplasm 

15 as described by Mouva et al., J. Biol . Chem. , 255:27, 
1980, and various restriction enzyme sites used to opera- 
tively link the V H and V L homologs to the expression 
vector. The V H expression-vector sequence also contains a 
short nucleic acid sequence that codes for amino acids 

2 0 typically found in variable regions heavy chain (V H 

Backbone) . This V H Backbone is just upstream and in the 
proper reading as the V H DNA homologs that are operatively 
linked into the Xho I and Spe I. The V L DNA homologs are 
operatively linked into the V L sequence (6B) at the Nco I 
25 and Spe I restriction enzyme sites and thus the V H Backbone 
region is deleted when the V L DNA homologs are operatively 
linked into the V L vector. 

Figure 7 depicts the major features of the bacterial 
expression vector Lambda Zap II V H (V H -expression vector) 

3 0 are shown. The synthetic DNA sequence from Figure 6 is 

shown at the top along with the T 3 polymerase promoter from 
Lambda Zap II vector. The orientation of the insert in 
Lambda Zap II vector is shown. The V H DNA homologs are 
inserted into the Xho I and Spe I restriction enzyme 
3 5 sites. The V H DNA are inserted into the Xho I and Spe I 
site and the read through transcription produces the 
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decapeptide epitope (tag) that is located just 3 1 of the 
cloning sites. 

Figure 8 depicts the major features of the bacterial 
expression vector Lambda Zap II V L (V L expression vector) 
5 are shown. The synthetic sequence shown in Figure 6B is 
shown at the top along with the T 3 polymerase promoter from 
Lambda Zap II vector. The orientation of the insert in 
Lambda Zap vector II is shown. The V L DNA homologs are 
inserted into the phagemid that is produced by the in vivo 
10 excision protocol described by Short et al., Nucleic Acids 
Res. . 16:7583-7600, 1988. The V L DNA homologs are inserted 
into the Nco I and Spe I cloning sites of the Phagemid. 

Figure 9 depicts a modified bacterial expression 
vector Lambda Zap II V L II. This vector is constructed by 
15 inserting this synthetic DNA sequence, 

TGAATTCTAAACTAGTCGCCAAGGAGACAGTCATAATGAA 

TCGAACTTAAGATTTGATCAGCGGTTCCTCTGTCAGTATTACTT 
ATACCTATTGCCTACGGCAGCCGCTGGATTGTTATTACTCGCTG 

TATGGATAACGGATGCCGTCGGCGACCTAACAATAATGAGCGAC 

2 0 CCCAACCAGCCATGGCCGAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 

GGGTTGGTCGGTACCGGCTCGAGCAGTCAAGATCTCAATTCGCCGGCAGCT 
into Lambda Zap II vector that has been digested with the 
restriction enzymes Sac I and Xho I. This sequence 
contains the Shine-Dalgarno sequence (ribosome binding 
25 site) , the leader sequence to direct the expressed protein 
to the periplasm and the appropriate nucleic acid sequence 
to allow the V L DNA homologs to be operatively linked into 
the SacI and Xbal restriction enzyme sites provided by 
this vector. 

3 0 Figure 10 depicts the sequence of the synthetic DNA 

segment inserted into Lambda Zap II vector to produce the 
lambda V L II-expression vector. The various features and 
restriction endonuclease recognition sites are shown. 

Figure 11 depicts the vectors for expressing V H and V L 
3 5 separately and in combination. The various essential 
components of these vectors are shown. The light chain 
vector or V L expression vector can be combined with the V H 
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expression vector to produce a combinatorial vector con- 
taining both V H and V L operatively linked for expression 
to the same promoter. 

Figure 12 depicts the labelled proteins immuno- 
5 precipitated from E. coli containing a V H and a V L DNA 
homolog are shown. In lane 1, the background proteins 
immunoprecipitated from E. coli that do not contain a V H or 
V L DNA homolog are shown. Lane 2 contains the V H protein 
immunoprecipitated from E. coli containing only a V H DNA 

10 homolog. In lanes 3 and 4, the commigration of a V H 

protein a V L protein immunoprecipitated from E. coli 

containing both a V H and a V L DNA homolog is shown. In 
lane 5 the presence of V H protein and V L protein expressed 
from the V H and V L DNA homologs is demonstrated by the two 

15 distinguishable protein species. Lane 5 contains the 
background proteins immunoprecipitated by anti-E. coli 
antibodies present in mouse ascites fluid. 

Figure 13 depicts the transition state analogue 
(formula 1) which induces antibodies for hydrolyzing 

2 0 carboxamide substrate (formula 2) . The compound of 

formula 1 containing a glutaryl spacer and a N- 
hydroxysuccinimide-1 inker appendage is the form used to 
couple the hapten (formula 1) to protein carriers KLH and 
BSA, while the compound of formula 3 is the inhibitor. 
25 The phosphonamidate functionality is a mimic of the 
stereoelectronic features of the transition state for 
hydrolysis of the amide bond. 

Figure 14 illustrates the PCR amplification of Fd and 
kappa regions from the spleen mRNA of a mouse immunized 

3 0 with NPN. Amplification was performed as described in 

Example 17 using RNA cDNA hybrids obtained by the reverse 
transcription of the mRNA with primer specific for ampli- 
fication of light chain sequences (Table 2) or heavy chain 
sequences (Table 1) . Lanes F1-F8 represent the product of 
35 heavy chain amplification reactions with one of each of 
the eight 5' primers (primers 2-9, Table 1) and the unique 
3" primer (primer 15, Table 2). Light chain (k) amplifi- 
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cations with the 5' primers (primers 3-6, and 12, respect- 
ively, Table 2) are shown in lanes F9-F13. A band of 700 
bps is seen in all lanes indicating the successful 
amplification of Fd and k regions, 
5 Figure 15 depicts the screening of phage libraries 

for antigen binding is depicted according to Example 17C. 
Duplicate plaque lifts of Fab (filters A, B) , heavy chain 
(filters E,F) and light chain (filters G,H) expression 
libraries were screened against 125 I-labelled BSA conjugated 

10 with NPN at a density of approximately 30,000 plaques per 
plate. Filters C and D illustrate the duplicate secondary 
screening of a cored positive from a primary filter A 
(arrows) as discussed in the text. 

Screening employed standard plaque lift methods. XL.1 

15 Blue cells infected with phage were incubated on 150mm 
plates for 4 hours at 37 °C, protein expression induced by 
overlay with nitrocellulose filters soaked in lOmM isopro- 
pyl thiogalactoside (IPTG) and the plates incubated at 25° 
for 8 hours. Duplicate filters were obtained during a 

20 second incubation employing the same conditions. Filters 
were then blocked in a solution of 1% BSA in PBS for 1 
hour before incubation with rocking at 25° for 1 hour with 
a solution of 125 I-labelled BSA conjugated to NPN (2 x 10 6 
cpm ml" 1 ; BSA concentration at 01 M; approximately 15 NPN 

25 per BSA molecule) in 1% BSA/PBS. Background was reduced 
by pre-centrifugation of stock radiolabelled BSA solution 
at 100,000 g for 15 minutes and pre-incubation of solu- 
tions with plaque lifts from plates containing bacteria 
infected with a phage having no insert. After labeling, 

30 filters were washed repeatedly with PBS/0.05% Tween 20 
before development of autoradiographs overnight. 

Figure 16 depicts the specificity of antigen binding 
as shown by competitive inhibition is illustrated accord- 
ing to Example 17C. Filter lifts from positive plaques 

125 

3 5 were exposed to I-BSA-NPN in the presence of increasing 
concentrations of the inhibitor NPN. 
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In this study a number of phages correlated with NPN 
binding as in Figure 15 were spotted (about 100 particles 
per spot) directly onto a bacterial lawns. The plate was 
then overlaid with an IPTG-soaked filter and incubated for 
5 19 hours at 25°. The filter were then blocked in 1% BSA 
in PBS prior to incubation in 125 I-BSA-NPN as described 
previously in Figure 15 except with the inclusion of vary- 
ing amounts of NPN in the labeling solution. Other 
conditions and procedures were as in Figure 15. The 

10 results for a phage of moderate affinity are shown in 
duplicate in the figure. Similar results were obtained 
for four other phages with some differences in the 
effective inhibitor concentration ranges. 

Figure 17 depicts the characterization of an antigen 

15 binding protein is illustrated according to Example 17D. 
The concentrated partially purified bacterial supernate of 
an NPN-binding clone was separated by gel filtration and 
aliquots from each fraction applied to microtitre plates 
coated with BSA— NPN . Addition of either anti-decapeptide 

2 0 ( ) or anti-kappa chain antibodies conjugated with alka- 

line phosphatase was followed by color development. The 
arrow indicates the position of elution of a known Fab 
fragment. The results show that antigen binding is a 
property of 50 kD protein containing both heavy and light 
25 chains. 

Single plaques of two-NPN-positive clones (Figure 15) 
were picked and the plasmid containing the heavy and light 
chain inserts excised. 500 ml cultures in L-broth were 
inoculated with 3 ml of a saturated culture containing the 

3 0 excised plasmids and incubated for 4 hours at 37 °C. 

Proteins synthesis was induced by the addition of IPTG to 
a final concentration of ImM and the cultures incubated 
for 10 hours at 25 °C. 200 ml of cells supernate were 
concentrated to 2 ml and applied to a TSK-G4 000 column. 
35 50 fil. aliquots from the eluted fractions were assayed by 
ELISA. 
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For ELISA analysis, microtitre plates were coated 
with BSA-NPN at 1 ug/ml, 50 ^1 samples mixed with 50 /il 
PBS-Tween 2 0 (0.05%)-BSA (0.1%) added and the plates 
incubated for 2 hours at 25°. After washing with PBS- 
5 Tween 20-BSA, 50 fil of appropriate concentrations of a 
rabbit anti-decapeptide antibody (20) and a goat anti- 
mouse kappa light chain (Southern Biotech) antibody 
conjugated with alkaline phosphatase were added and 
incubated for 2 hours at 25°. After further washing, 50 
10 (Ml of p-nitrophenyl phosphate (lmg/ml in 0.1M Tris pH 9.5 
containing 50 mM MgCl 2 ) were added and the plates incubated 
for 15-30 minutes before reading the OD at 405nm. 

Figure 18A depicts the major features of the 
bacterial expression vector HCFLP containing a V H DNA 
15 homolog and a flp recombination site. 

Figure 18B depicts the major features of the 
bacterial expression vector LCFLP containing a V L DNA 
homolog and a flp recombination site properly oriented for 
recombination with the HCFLP vector. 

2 0 Figure 19 depicts a diagrammatic sketch of bacterial 

coinfection with HCFLP and LCFLP vectors for the produc- 
tion of recombinant expression vectors containing V L and V H 
DNA homologs. 

Figure 20 depicts an outline showing arm selection 
25 for heavy and light chain recombinant vector products 
using flp recombinase in conjunction with selection based 
on the inclusion of genes having amber mutations. 

Figure 21 shows an outline of a method of phenotype 
creation using the fusion PCR process described herein. 

3 0 Figure 22 illustrates human fusion PCR inside 

primers. The heavy chain C H 1' inside primer sequence is 
written 3 1 to 5 1 and the light chain V L inside primer 
sequence is written 5 1 to 3 1 . Note that it is not the 
primer strands that cross-prime to create the fusion 
35 molecule, but the complementary PCR product strands. 
Boxed nucleotides represent regions where the C H 1 ! primer 
hybridizes to the 3 1 end of C H 1 on human IgG heavy chain 
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mRNA or where the V L primer hybridizes to the 5 f end of V L 
framework-1 on human kappa light chain cDNA. Underlined 
sequences indicate the two stop condons. The italicized 
amino acid and nucleotides indicate changes in sequence 
5 from the original pelB leader sequence* The mouse fusion- 
PCR internal primers overlap in a similar manner. 

Figure 23 illustrates an ethidium bromide stained 
agarose gel. After PCR amplification from human cloned 
DNA of heavy chain alone (HC) , light chain alone (LC) , and 

10 the heavy/light dicistronic DNA molecule (H/L) , DNA sam- 
ples were electrophoresed. The expected sizes of the HC, 
LC, and H/L products visualized on the gel were approxi- 
mately 73 0, 690, and 1,3 90 base pairs, respectively. 

Figures 24A and 24B illustrate the major features of 

15 the bacterial expression vector Lambda ZAP II Modified V H 
(Modified ImmunoZAP H) (V H -expression vector) (IZ H) . The 
amino acids encoded by the synthetic DNA sequence from 
Figure 24A is shown along with the T 3 polymerase promoter 
from Lambda ZAP II. The orientation of the insert in 

2 0 Lambda ZAP II is as presented. The insert was modified by 

the elimination of the Sac I site between the T 3 polymerase 
and Not I site and by the change of amino acids at the 5 ' 
end of the heavy chain from QVKL to QVQL (alysine residue 
was changed to a glutamine residue) . The V H and V L DNA 
25 homologs were inserted into the Xho I and Xba I cloning 
sites of the phagemid as described in Figure 26 and shown 
in Figure 2 4B. The modifications were made to create a 
fusion-PCR library from hybridoma RNA, to overcome 
decreased efficiency of secretion of positively charged 

3 0 amino acids in the amino terminus of the protein. Inouye 

et al., Proc. Natl. Acad. Sci.. USA , 85:7685-7689 (1988), 
and to make the V L Sac I cloning site a unique restriction 
site. 

Figures 25A and 25B illustrate the sequences of the 
3 5 synthetic DNAs inserted into Lambda ZAP to produce Lambda 
Zap II V H (ImmunoZAP H) (25A) and Lambda Zap V L (ImmunoZAP 
L) (25B) expression vectors. The various features 
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required for these vectors to express the V H and V L -coding 
DNA homologs include the Shine-Dalgarno ribosome binding 
site, a leader sequence to direct the expressed protein to 
the periplasm as described by Mouva et al., J. Biol. 
5 Chem. . 255:27, 1980, and various restriction enzyme sites 
used to operatively link the V H and V L homologs to the 
expression vector • The V H expression-vector sequence also 
contains a short nucleic acid sequence that codes for 
amino acids typically found in variable regions of the 

10 heavy chain (V H Backbone) . This V H Backbone is just 
upstream and in the proper reading frame as the V H DNA 
homologs that are operatively linked into the Xho I and 
Spe I restriction sites. The V L DNA homologs are opera- 
tively linked into the V L sequence (25B) at the Sac I and 

15 Xba I restriction enzyme sites. 

Figure 26 illustrates the major features of the 
bacterial expression vector Lambda Zap II V H (ImmunoZAP H) 
(V H - expression vector) . The amino acids encoded by the 
synthetic DNA sequence from Figure 25A is shown at the top 

2 0 along with the T 3 polymerase promoter from Lambda Zap II. 

The orientation of the insert in Lambda Zap II is as pre- 
sented. The V H DNA homologs were inserted into the 
phagemid that is produced by the in vivo excision protocol 
described by Short et al., Nucleic Acids Res. , 16:7583- 
25 7600, 1988. The V H DNA homologs were inserted into the 
Xho I and Spe I restriction enzyme sites. The read 
through transcription produces the decapeptide epitope 
(tag) that is located just 3 1 of the cloning sites. 

Figure 27 illustrates the major features of the 

3 0 bacterial expression vector Lambda Zap II V L (ImmunoZAP L) 

(V L expression vector) . The amino acids encoded by the 
synthetic DNA sequence shown in Figure 25B is shown at the 
top along with the T 3 polymerase promoter from Lambda Zap 
II. The orientation of the insert in Lambda Zap II is as 
3 5 presented. The V L DNA homologs are inserted into the Sac 
I and Xba I cloning sites of the phagemid as described in 
Figure 26. 
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Figure 28 illustrated an autoradiogram showing 
signals obtained from human phage clones. Approximately 
100 lambda phage were spotted onto E^coli lawns, creating 
plaques that were overlaid with nitrocellulose filters 
previously soaked in 10 mM isopropylbeta-D-thiogalacto- 
pyranoside (IFEG) to induce Fab expression. Following 



py x. anus \ — - — ~- / — , 

overnight incubation, the filters were reacted with 
tetanus toxoid probe. After washing, the filters were 
exposed to X-ray film. The column on the right represents 
10 the parental clones that were selected form a combina- 
torial library. Mullinax et al., Proc Natl ■ Acad. Sci., 
USA, 87:8095-8099 (1990). The column on the left repre- 
sents clones that were generated by amplifying, the 
combinatorial lambda clone DNA with the V H and C L ' outside 
15 primers, C H 1' and V L inside primers, followed by recloning 
in the modified ImmunoZAP H vector. Clone 7G1 is a nega- 
tive control which expresses a Fab that does not react 
with tetanus toxoid. Clones 10C1 and 6C12 both produce 
Fabs that react with tetanus toxoid. IZ H is the modified 
20 heavy chain ImmunoZAP H vector without an insert. 

Detailed nescripi-ion of the Invention 
A. npfinitions 

As used herein, the following terms have the 
following menings unless expressly stated to the contrary: 
25 Nucleotide : a monomeric unit of DNA or RNA consist- 

ing of a sugar moiety (pentose), a phosphate, and a nitro- 
genous heterocyclic base. The base is linked to the sugar 
m oiety via the glycosidic carbon (!' carbon of the 
pentose) and that combination of base and sugar is a 
30 nucleoside. When the nucleoside contains a phosphate 
group bonded to the 3- or 5 • position of the pentose it is 
referred to as a nucleotide. 

Base_Pair (bp) : a pairing (by hydrogen bonding) of 
adenine (A) with thymine (T) , or of cytosine (C) with 
35 guanine (G) in a double stranded DNA molecule. In RNA, 
uracil (U) is substituted for thymine. 
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Nucleic Acid ; a polymer of nucleotides, either 
single or double stranded. 

Gene: a nucleic acid whose nucleotide sequence codes 
for an RNA or polypeptide. A gene can be either RNA or 
5 DNA. 

Complementary Bases ; nucleotides that normally pair 
up when DNA or RNA adopts a double stranded configuration. 

. Complementary Nucleotide Sgg uepcg.; a sequence of 
nucleotides in a single-stranded molecule of DNA or RNA 
10 that is sufficiently complementary to that on another 
single strand to specifically hybridize to it with 
consequent hydrogen bonding. 

Conserved ; a nucleotide sequence is conserved with 
respect to a preselected (reference) sequence if it non- 
15 randomly hybridizes to an exact complement of the 
preselected sequence. 

Hybridization ; the pairing of substantially 

complementary nucleotide sequences (strands of nucleic 
acid) to form a duplex or heteroduplex by the establish- 
ment of hydrogen bonds between complementary base pairs. 
It is a specific, i^ non-random, interaction between two 
complementary polynucleotides that can be competitively 
inhibited. 



20 



25 



30 



35 



Nucleotide Analoq ; a purine or pyrimidine nucleotide 
that differs structurally from A, T, G, C, or U, but is 
sufficiently similar to substitute for the normal nucleo- 
tide in a nucleic acid molecule. 

DNA Homoloq ; is a nucleic acid having a preselected 
conserved nucleotide sequence and a sequence coding for a 
receptor capable of binding a preselected ligand. 

Receptor: A receptor is a molecule, such as a 
protein, glycoprotein and the like, that can specifically 
(non-randomly) bind to another molecule. 

Antibody ; The term antibody in its various grammati- 
cal forms is used herein to refer to immunoglobulin 
molecules and immunologically active portions of immuno- 
globulin molecules, i.e., molecules that contain an 
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antibody combining site or paratope. Exemplary antibody 
molecules are intact immunoglobulin molecules, substan- 
tially intact immunoglobulin molecules and portions of an 
immunoglobulin molecule, including those portions known in 
5 the art as Fab, Fab 1 , F(ab') 2 and F(v) . 

Antibody Combining Site ; An antibody combining site 
is that structural portion of an antibody molecule com- 
prised of a heavy and light chain variable and hypervari- 
able regions that specifically binds (immunoreacts with) 

10 an antigen. The term immunoreact in its various forms 
means specific binding between an antigenic determinant- 
containing molecule and a molecule containing an antibody 
combining site such as a whole antibody molecule or a 
portion thereof. 

15 Monoclonal Antibody ; The phrase monoclonal antibody 

in its various grammatical forms refers to a population of 
antibody molecules that contains only one species of anti- 
body combining site capable of immunoreact ing with a 
particular antigen. A monoclonal antibody thus typically 

20 displays a single binding affinity for any antigen with 
which it immunoreacts. A monoclonal antibody may there- 
fore contain an antibody molecule having a plurality of 
antibody combining sites, each immunospecif ic for a 
different antigen, e.g., a bispecific monoclonal antibody. 

25 Upstream ; In the direction opposite to the direction 

of DNA transcription, and therefore going from 5 1 to 3 1 on 
the non-coding strand, or 3 1 to 5 1 on the mRNA. 

Downstream ; Further along a DNA sequence in the 
direction of sequence transcription or read out, that is 

3 0 traveling in a 3 1 - to 5 1 -direction along the non-coding 
strand of the DNA or 5 1 - to 3' -direction along the RNA 
transcript. 

Cistron ; Sequence of nucleotides in a DNA molecule 
coding for an amino acid residue sequence. 
3 5 Stop Codon ; Any of three codons that do not code for 

an amino acid, but instead cause termination of protein 
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synthesis. They are UAG, UAA and UGA. Also referred to 
as a nonsense or termination codon. 

Leader Polypeptide : A short length of amino acid 
sequence at the amino end of a protein, which carries or 
5 directs the protein through the inner membrane and so 
ensures its eventual secretion into the periplasmic space 
and perhaps beyond. The leader sequence peptide is 
commonly removed before the protein becomes active. 

Reading Frame ; Particular sequence of contiguous 
10 nucleotide triplets (codons) employed in translation. The 
reading frame depends on the location of the translation 
initiation codon. 

Inside Primer : An inside primer is a polynucleotide 
that has a priming region located at the 3 1 terminus of 
15 the primer which typically consists of 15 to 3 0 nucleotide 
bases. The 3' terminal-priming portion is capable of 
acting as a primer to catalyze nucleic acid synthesis. 
The 5 1 -terminal priming portion comprises a non-priming 
portion. 

20 Outside Primer : An outside primer comprises a 3 1 - 

terminal priming portion and a portion that may define an 
endonuclease restriction site which is typically located 
in a 5 '-terminal non-priming portion of the outside 
primer . 

25 Fusion Polynucleotide Amplification : refers to in 

vitro techniques of generating a multiple complementary 
copies of a nucleic acid template which comprises nucleo- 
tide sequences which have been randomly combined to give 
a combined nucleic sequence. These techniques typically 

3 0 employ complementary primers which hybridize to the 
template and are extended in a primer extension reaction. 
The polyumerase chain reaction (PCR) techniques described 
herein comprise a preferred method of nucleotide sequence 
amplifications. Generation and amplification of a 

3 5 combined nucleotide sequence using fusion PCR is further 
described herein. 
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10 



vector: As used herein, the term "vector" refers to 
a nucleoid molecule capable to transporting between 
different genetic environments another nuclexc add to 
which it has been operatively linked. One type of pre- 
ferred vector is an episome, i.e., a nucleic acid molecule 
capable of extra-chromosomal replication. Other suitable 
vectors include plasmid and cosmid vectors and phage, 
especially bateriophage such as lambda. Preferred vectors 
are those capable of autonomous replication and/or expres- 
sion of nucleic acids to which they are linked. Vectors 
capable of directing the expression of genes to which they 
are operatively linked are referred to herem as 
"expression vectors" . 



15 



20 



25 



30 



B. Methods , m . allv 

Until this invention, genetic engineers typically 
dealt with the expression of a single gene or family (or 
population) of genes, one at a time. The expression of a 
family of genes in a vector is generally referred to as a 
-gene library." Each member of the library will normally 
contain a different gene or DNA sequence. However, the 
vector portion of such a vector-gene fusion is typxcally 
identical from member to member. (Maniatis et a±. , 
suera) . individual members within the library may often 
he, and typically are, amplified before —to 
identify and isolate a desired member. Amplxf xcatxon 
occurs so that each library member grows as a bacterxal 
colony (for plasmid libraries) or phage plagues (for 
bacteriophage libraries, such as lambda) . These amplxf xed 
.embers are usually referred to as "clones," since each 
colony or plaque is made up of many identxcal host cells 

or phage particles. _ 

The search for a particular clone containing a single 
gene or DNA sequence of interest can be accomplished in 
m any different ways. The clone may be identified because 
its vector-gene specifically hybridizes with a nucleic 
acid probe. It may also be identified by expression of an 
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RNA species that can be identified, for example by nucleic 
acid hybridization. The RNA species may, furthermore, be 
translated into a protein, typically by the host cell, 
that may be identified, for example, by reactivity with an 
5 antibody probe. Alternatively, the protein may be recog- 
nized because it binds a substrate, or catalyzes a 
reaction, or allows the host cell to survive under 
selective conditions, and so on. 

Described herein are libraries in which two or more 
10 families (or populations) of genes are expressed in a 
vector or a host cell in such a way that the gene combi- 
nations are randomly represented and subsequently detected 
on the basis of some property or characteristic in the 
event that a particular combination of one member from a 
15 first gene family and one gene from a one or more other 
gene families are combined in a vector host cell. For 
example, in the general case if there are »i" members of 
the gene family "A" and »j" members of the gene family 
"B«, there will be (i) x (j) combinations of selected gene 
2 0 members A and B in the randomly created vector-gene 
population. if there are three gene families, A, B, and 
C, and a vector is made containing one member from each of 
the three gene families, the total number of combinations 
of genes will be the product of the number of A genes 
25 times the number of b genes times the number of c genes. 
Thus, methods are provided wherein at least two genes may 
randomly be combined, preferably on the same vector 
molecule, having been identified within a population of 
vectors containing other combinations of different genes 
30 from the same two or more gene families. This approach 
may be broadly accomplished by means other than recombi- 
nation, for example, the use of a vector having at least 
two independent insertion sites for two foreign genes or 
inserting in a vector a nucleotide sequence comprising 
1 5 nucleotide sequences from each gene family. The recombi- 
nation of at least two separate library populations to 
make a combinatorial population, for example, using a 
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common restriction site or site-directed recombination 
systems, is also contemplated. 

Thus, in addition to the above-described methods, the 
invention also provides for vectors having characteristics 
5 and sequences useful for the preparation of combinatorial 
vectors encoding random DNA sequences from two or more 
gene families. Such vectors include plasmids and phage 
containing common restriction sites or sequences enabling 
the in vivo recombination of said DNA sequences from said 

10 gene families. 

The flp site-specific recombination of cervisiae 
has been described in Cox, Chapter 13 in "Genetic Recombi- 
nation," eds. R. Kucherlapati and G. Smith (American 
Society for Microbiology 1988) . Within a sixty-five bp 

15 region identified as the recombination site and designated 
FRT (flp recombination target), there are several promi- 
nent structural features. The most important are a set of 
three bp repeats. The second and third repeats are separ- 
ated by one bp and are in the same orientation. The first 

2 0 repeat is inverted with respect to the other two and is 

separated from the second repeat by an eight bp spacer. 
The first repeat also has a one bp mismatch relative to 
the first two. Deletion analysis has demonstrated that 
the third repeat is unnecessary for recombination in 
25 vitro, although it may have a slight effect on the reac- 
tion in vivo . Additional deletions indicate that most, 
but not all, of the first and second repeats (those 
flanking the spacer) are required. While deletion of 
three bp from the distal ends of one or both of these 

3 0 repeats has no detectable effect on the reaction, further 

deletion leads to a gradual reduction in site function, 
with complete loss of site function occurring (in vitro ) 
with deletions of eight bp or more from either end. The 
minimal site required for a full function in vitro is 
3 5 therefore relatively small (approximately 28 bp including 
the spacer and the proximal 10 bp of each flanking 
repeat) . Accordingly, it will be seen that the full, 
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intermediate, or minimal FRT sequences can be utilized to 
accomplish flp-mediated site-specific recombination. 

The lambda phage attachment site is responsible for 
integration of lambda into the host chromosome. It also 
5 acts as a hot spot of recombination and lytic crosses 
between wild lambda chromosomes. As in lambda, in PI 
phage a site-specific cross over site, loxP acts as a hot 
spot of recombination. This site is recognized by the Pi 
ere protein, a known site-specific protein. The site- 

10 specific recombination system is responsible for the rare 
integration of PI into the host chromosome. The cre-lox 
system of bacteriaphage PI is also useful for the site- 
specific recombination contemplated by the invention 
described and claimed herein. 

15 A transposon can jump from one vector to another 

vector or from a vector to a bacterial chromosome. 
Different transposons having different inverted repeat 
sequences and carrying, for example, different drug- 
resistance genes, can be used to carry out the desired 

2 0 random combination of genes as described herein either in 

vivo or in vitro . The transposon may, but need not, also 
contain a sequence encoding the transposase enzyme which 
catalyzes the "hop." Various suitable transposon systems 
have been described in the literature. (See, Mobile DNA, 
25 Douglas E. Berg and Martha M. Howe, eds., American Society 
for Microbiology, Washington, D.C., 1989). One suitable 
transposon system is the gamma-delta transposon system 
which has been isolated from E. Coli. 

Thus, in addition to restriction digestion and 

3 0 ligation, use of flp type recombination systems, and 

homologous recombination, a transposon system can also be 
used to integrate a light (or heavy) antibody chain clone, 
into a heavy (or light) antibody chain clone. For 
example, this can be accomplished by flanking the light 
35 chain expression and cloning region with transposon 
terminal sequences. A library constructed in this light 
chain vector could be used to co-infect bacteria with 
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clones from the heavy chain library. The light chain 
inserts between the terminal sequences would hop from the 
light chain lambda phage vector into other DNA sequences 
in the presence of transposase activity. Selection for 
5 hopping into the heavy chain clone can be accomplished by 
placing a selectable marker within the light chain, posi- 
tioned between the transposon hopping sequences. 
Subsequently, phage recovered from the co-infected culture 
is plated with a strain enabling selection for the heavy 

10 chain vector and for the light chain marker gene. Because 
this second plating is performed under conditions of a 
high cell to phage ratio, only one lambda phage will 
typically be introduced into each cell. The lambda phage 
should grow only if the phage contains genes from both the 

15 heavy and light chain clones; most efficiently resulting 
from the transposon hop. If the hop occurs in the essen- 
tial genes of the heavy chain clone, the phage will not 
grow. Only phage containing the transposon in the proper 
position within the heavy chain will grow. A collection 

2 0 of these clones comprises a library of combinatorial heavy 

and light chain antibody clones. 

According to one aspect of the present invention, 
fusion PCR is used to generate two PCR-amplif ied DNA 
fragments, each of which have one of their ends modified 
25 by directed mispriming so that those ends share regions of 
complementarity , i.e., cohes ive termini . When the two 
fragments are mixed, denatured and reannealed in a PCR 
cycle, the cohesive termini on two strands hybridize to 
form an "overlapping" DNA duplex that is internally 

3 0 primed. The subsequent PCR cycle primer-extends the non- 

overlapping regions to form a hybride DNA molecule that is 
dicistronic. See Figure 21. 

PCR amplification methods are described in detail in 
U.S. Patent Nos. 4,863,192, 4,683,202, 4,800,159, and 
35 4,965,188, and at least in several texts including "PCR 
Technology: Principles and Applications for DNA 

Amplification", H. Erlich, ed. , Stockton Press, New York 
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(1989) ; and "PGR Protocols" A Guide to Methods and 
Applications", Innis et al., eds., Academic Press, San 
Diego, California (1990) . 

Thus, in one aspect of the present invention, fusion 
5 PCR is used to produce a library of dicistronic DNA mole- 
cules ocntaining upstream and downstream cistrons wherein 
first and second PCR amplification products are produced 
using respective first and second PCR primer pairs. The 
first PCR primer pair comprises a first polypeptide 

10 outside primer and a first polypeptide inside primer. 
Similarly, the second PCR primer pair comprises a second 
polypeptide outside primer and a second polypeptide inside 
primer. The first and second polypeptide inside primers 
contain complementary 5" -terminal sequences that allow 

15 their DNA complements to hybridize and form an internally- 
primed duplex having 3 1 -overhanging termini. The 
internally-primed duplex is then subjected to primer 
extension reaction conditions to produce a double 
stranded, dicistronic DNA having substantially blunt or 

20 blunt ends. The dicistronic DNA is then PCR amplified 
using the outside primers as a PCR primer pair. 

The dicistronic DNA molecule comprises two amino acid 
residue-coding sequences on the same strand separated by 
at least one stop codon and at least one signal sequence 

25 necessary for translation of the downstream cistron, such 
as a translation initiation codon, ribosome binding site, 
and the like. Thus, the upstream and downstream cistrons 
of the dicistronic DNA molecule are operatively linked by 
a cistronic bridge. The cistronic bridge comprises the 

3 0 genetic elements necessary to terminate translation of the 
upstream cistron and initiate translation of the down- 
stream cistron. For instance, the coding strand of the 
bridge codes for one or more stop codons, preferably two, 
in the same translational reading frame as the upstream 

3 5 cistron. The cistronic bridge coding strand preferably 
also encodes a ribosome binding site for the dowstream 
cistron located downstream from the upstream cistron' s 
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stop codon(s). Typically, the coding strand of the 
cistronic bridge will also encode a leader polypeptide 
segment in the same translational reading frame as the 
downstream cistron. When present, the nucleotide base 
5 sequence encoding the leader usually begins with an 
initiation codon located within an operative distance, 
i.e. is operatively linked, to the ribosome binding site. 

The following discussion illustrates the use of 
fusion PCR to isolate a pair of V H and V L genes from the 

10 immunoglobulin gene repertoire. This discussion is not to 
be taken as limiting, but rather as illustrating an appli- 
cation of creating a novel phenotype by combining one 
member from each of two or more families of genes. The 
illustrated method can be used with other families of 

15 conserved genes which each for one unit of a dimeric 
receptor, whether obtained directly from a natural source, 
such naive or in vivo immunized cells, or from cells or 
one or more genes that have been treated or mutagenized in 
vitro. Generally, the method, combines the following 

2 0 elements: 

1. Producing V H and V L gene repertoires. 

2. Preparing sets of outside and inside polynucleo- 
tide primers for cloning polynucleotide segments 
containing immunoglobulin V H and V L region genes. 

25 3. Preparing a library containing a plurality of 

different dicistronic DNA molecules, each containing a V H 
and a V L gene from the respective repertoires. 

4. Expressing the dicistronic DNA molecules in 
suitable host cells. 

3 0 5. Screening the polypeptides expressed by the 

dicistronic DNA molecules for the preselected activity, 
and segregating a dicistronic DNA molecules for the 
preselected activity, and segregating a dicistronic DNA 
molecule identified by the screening process. 
35 The present invention also provides a novel 

method for screening variants of a parental clone or 
clones. If the parental clone or clones contain two 
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a phenotype, then such nucleotide sequences can be altered 
to create populations of variants of such nucleotide 
sequences. If the two variant populations are coexpressed 
5 in a random fashion (that is with no correlation between 
the specific alterations made in the two different nucleo- 
tide sequences) , then a combinatorial collection of such 
nucleotide sequence variants has been created. Such com- 
binatorial collections may be screened for the presence 
10 of phenotypes that are unlike the parental clone or 
clones. Generally, the method combines the following 
elements: 

1. Replicating a clone containing a nucleotide 
sequence under conditions that allow mutations to occur. 

15 2. Replicating a second clone containing a second 

nucleotide sequence under conditions that allow mutations 
to occur. 

3 . Randomly combining and co-expressing the two 
mutated populations of nucleotide sequences. 
20 4. Screening clones containing combinations of 

mutated nucleotide sequences for phenotypes that were not 
present in either parent clone. 

Alternatively, the methods combine the following 
elements : 

25 1. Replicating at least portions of two nucleotide 

sequences contained within a single clone under conditions 
that allow mutations to occur in either nucleotide 
sequence. 

2. Allowing recombination events between the two 
3 0 nucleotide sequence populations to reassociate mutant 

nucleotide sequences to form new pairs of the two 
sequences that were not paired in the original mutated, 
replicated population. 

3 . Screening clones containing combinations of 
3 5 nucleotide sequences for phenotypes that were not present 

in the parent clone or in the mutant replicas of the 
parent clone. 
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For example, assume a parent clone containing two 
nucleotide sequences A and B is replicated under mutating 
conditions such that variant clones are formed: 

Parent: A/B 
5 Variant 1: Al/B 

Variant 2: A/Bl 

Variant 3: A2/B1 

Variant 4: A/B2 

Variant 5: A3/B 
10 However, within this mutated population, the combinations 
A1/B2, A2/B, A2/B2, A3/B1, and A3/B2 , do not occur. If 
the mutant population (including some non-mutated parent 
clones) is allowed to recombine sequences A and B and 
their variants, then combinations such as A1/B2 , A2/B etc. 
15 can be created. Such new combinations may express a 
desired phenotype that was not present in the parental or 
the variant population. 

In one aspect, the present invention is related to 
methods for tapping the immunological repertoire by 
2 0 isolating from V H -coding and V L -coding gene repertoires 
genes coding for a heterodimeric antibody receptor capable 
of binding a preselected ligand. Generally, the method 
combines the following elements: 

1. Isolating nucleic acids containing a substantial 

2 5 portion of the immunological repertoire. 

2. Preparing polynucleotide primers for cloning 
polynucleotide segments containing immunoglobulin V H and V L 
region genes. 

3. Preparing a gene library containing a plurality 

3 0 of different V H and V L genes from the repertoire. 

4 . Expressing the V H and V L polypeptides in a 
suitable host, including prokaryotic and eukaryotic hosts, 
on the same expression vector. 

5. Screening the expressed polypeptides for the 
3 5 preselected activity, and segregating a V H - and V L -coding 

gene combination identified by the screening process. 
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In one aspect, the expressed phenotype produced by 
the methods by the present invention comprises a multi- 
meric polypeptide product (i.e. a heterodimer, etc.) which 
assumes a conformation having a binding site specific for, 
5 as evidenced by its ability to be competitively inhibited, 
a preselected or predetermined ligand such as an antigen, 
enzymatic substrate and the like. In one embodiment, the 
multimeric polypeptide is an antibody that forms an anti- 
gen binding site which specifically binds to .a preselected 

10 antigen to form an immunoreaction product (complex) having 
a sufficiently strong binding between the antigen and the 
binding site for the immunoreaction product to be iso- 
lated. The antibody typically has an affinity or avidity 
is generally greater than 10 5 -M" 1 . 

15 In another embodiment, a multimeric polypeptide 

produced according to the present invention is capable of 
binding a substrate and catalyzes the formation of a 
product from the substrate. While the topology of the 
ligand binding site of a catalying multimeric polypeptide 

2 0 is probably more important for its preselected activity 

than its affinity (association constant or pKa) for the 
substrate, the useful catalytic multimeric polypeptides 
typically have an association constant for the preselected 
substrate generally greater than 10 3 M~ 1 , more usually 
25 greater than 10 5 M~ 1 or 10 6 M" 1 and preferably greater than 
10 7 M" 1 . 

Preferably the multimeric polypeptide produced 
according to the present invention is heterodimeric and is 
therefore normally comprised of two different polypeptide 

3 0 chains, which together assume a conformation having a 

binding affinity, or association constant for the pre- 
selected ligand that is different, preferably higher, than 
the affinity or association constant of either of the 
polypeptides alone, i.e., as monomers. In a particularly 
3 5 preferred aspect, one or both of the different polypeptide 
chains is derived from the variable region of the light 
and heavy chains of an immunoglobulin. Typically, poly- 
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peptides comprising the light (V L ) and heavy (V H ) variable 
regions are employed together for binding the preselected 
ligand. 

A V or V L produced by the methods of the subject 
5 invention can be active in monomer ic as well as multimeric 
forms, either homomeric or heteromeric, preferably hetero- 
dimeric. A V H and V L ligand binding polypeptide produced 
by the present invention can be advantageously combined in 
a heterodimer (antibody molecule) to modulate the activity 

10 of either or to produce an activity unique to the hetero- 
dimer. The individual ligand binding polypeptides will be 
referred to as V H and V L and the heterodimer will be 
referred to as an antibody molecule. 

However, it should be understood that a V H binding 

15 polypeptide may contain in addition to the V H , substan- 
tially all or a portion of the heavy chain constant 
region. A V L binding polypeptide may contain, in addition 
to the V L/ substantially all or a portion of the light 
chain constant region. A heterodimer comprised of a V H 

2 0 binding polypeptide containing a portion of the heavy 

chain constant region and a V L binding containing substan- 
tially all of the light chain constant region is termed a 
Fab fragment. The production of a Fab can be advantageous 
in some situations because the additional constant region 
25 sequences contained in a Fab as compared to a F v could 
stabilize the V H and V L interaction. Such stabilization 
could cause the Fab to have higher affinity for antigen. 
In addition the Fab is more commonly used in the art and 
thus there are more commercial antibodies available to 

3 0 specifically recognize a Fab. 

The individual V H and V L polypeptides may be produced 
in lengths equal or substantially equal to their naturally 
occurring lengths. However, the individual V H and V L poly- 
peptides will generally have fewer than 12 5 amino acid 
35 residues, more usually fewer than about 120 amino acid 
residues, while normally having greater than 60 amino acid 
residues, usually greater than about 95 amino acid 
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residues, more usually greater than about 100 amino acid 
residues. Preferably, the V H will be from about 110 to 
about 12 5 amino acid residues in length while V L will from 
about 95 to about 115 amino acid residues in length. 
5 The amino acid residue sequences of the polypeptides 

will vary widely, depending upon the particular idiotype 
involved. Usually, there will be at least two cysteines 
separated by from about 60 to 75 amino acid residues and 
joined by a disulfide bond. The polypeptides produced by 
10 the subject invention will normally be substantial copies 
of idiotypes of the variable regions of the heavy and/or 
light chains of immunoglobulins, but in some situations a 
polypeptide may contain random mutations in amino acid 
residue sequences in order to advantageously improve the 
15 desired activity. 

In some situations, it is desireable to provide for 
covalent cross linking of the V H and V L polypeptides, which 
can be accomplished by providing cysteine resides at the 
carboxyl termini. The polypeptide will normally be pre- 
20 pared free of the immunoglobulin constant regions, however 
a small portion of the J region may be included as a 
result of the advantageous selection of DNA synthesis 
primers. The D region will normally be included in the 
transcript of the V H . 
25 In other situations, it is desirable to provide a 

peptide linker to connect the V L and the V H to form a 
single-chain antigen-binding protein comprised of a V H and 
a V L . This single-chain antigen-binding protein would be 
synthesized as a single protein chain. Such a single- 
3 0 chain antigen binding proteins have been described by Bird 
et al., Science , 242:423-426 (1988). The design of 
suitable peptide linker regions is described in U.S. 
Patent No. 4,704,692 by Robert Landner. 

Such a peptide linker may be designed as part of the 
35 nucleic acid sequences contained in the expression vector. 
The nucleic acid sequences coding for the peptide linker 
would be between the V H and V L DNA homologs and the 
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restriction endonuclease sites used to operatively link 
the V u and V. DNA homologs to the expression vector. 

H L 

Such a peptide linker also may be coded for nucleic 
acid sequences that, are part of the polynucleotide primers 
5 used to prepare the various gene libraries. The nucleic 
acid sequence coding for the peptide linker can be made up 
of nucleic acids attached to one of the primers or the 
nucleic acid sequence coding for the peptide linker may be 
derived from nucleic acid sequences that are attached to 
10 several polynucleotide primers used to create the gene 
libraries. 

Typically the C terminus region of the V H and V L 
polypeptides will have a greater variety of the sequences 
than the N terminus and, based on the present strategy, 
15 can be further modified to permit a variation of the 
normally occurring V H and V L chains. A synthetic 

polynucleotide and be employed by vary one or more amino 
in an hypervariable region. 

1. Isolation Of A Gene Rep ertoire 

20 According to one aspect of the present invention, a 

gene repertoire useful in the methods the present inven- 
tion contains at least 10 3 , preferably at least 10 4 , more 
preferably at least 10 5 , and most preferably at least 10 7 
different consderved genes. Methods for evaulating the 

25 diversity of a repertoire of conserved genes are well 
known to one skilled in the art. 

Various well known methods can be employed to produce 
a useful gene repertoire. For example, to prepare a 
composition of nucleic acids containing a substantial 

3 0 portion of the immunological gene repertoire, a source of 
genes coding for the V H and/or V L polypeptides is required. 
Preferably the source will be heterogeneous population of 
antibody producing cells, i.e. , B lymphocytes (B cells), 
preferably rearranged B cells such as those found in the 

3 5 circulation or spleen of a vertebrate. (Rearranged B 
cells are those in which immunoglobulin gene transloca- 
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tion, i.e., rearrangement, has occurred as evidenced by 
the presence in the cell of mRNA with the immunoglobulin 
gene V, D and J region transcripts adjacently located 
thereon . ) 

5 In some cases, it is desirable to bias the repertoire 

for a preselected activity, such as by using as a source 
of nucleic acid cells (source cells) from vertebrates in 
any one of various stages of age, health and immune 
response. For example, repeated immunization of a healthy 

10 animal prior to collecting rearranged B cells results in 
obtaining a repertoire enriched for genetic material 
producing a ligand binding polypeptide of high affinity. 
See, e.g. Mullinax et al., Proc. Nat. Acad. Sci. (USA) 
87:8095-8099 (1990). Conversely, collecting rearranged B 

15 cells from a healthy animal whose immune system had not 
been recently challenged results in producing a repertoire 
that is not biased towards the production of high affinity 
V H and/or V L polypeptides. 

It should be noted the greater the genetic hetero- 

2 0 geneity of the population of cells for which the nucleic 

acids are obtained, the greater the diversity of the 
immunological repertoire that will be made available for 
screening according to the method of the present inven- 
tion. Thus, cells from different individuals of different 
25 strains, races or species can be advantageously combined 
to increase the heterogeneity (diversity) of the 
repertoire. 

Thus, in one preferred embodiment, the source cells 
are obtained from a vertebrate, preferably a mammal, which 

3 0 has been immunized or partially immunized with an anti- 

genic ligand (antigen) against which activity is sought, 
i.e., a preselected antigen. The immunization can be 
carried out conventionally. Antibody titer in the animal 
can be monitored to determine the stage of immunization 
3 5 desired, which stage corresponds to the amount of enrich- 
ment or biasing of the repertoire desired. Partially 
immunized animals typically receive only one immunization 
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and cells are collected therefrom shortly after a response 
is detected. Fully immunized animals display a peak 
titer, which is achieved with one or more repeated injec- 
tions of the antigen into the host mammal, normally at 2 
5 to 3 week intervals. Usually three to five days after the 
last challenge, the spleen is removed and the genetic 
repertoire of the spleenocytes, about 90% of which are 
rearranged B cells, is isolated using standard procedures. 
See, current Protocols in Molecula r Biology, Ausubel et 

10 al., eds., John Wiley & Sons, NY. 

Nucleic acids coding for V H and V L polypeptides can be 
derived from cells producing IgA, IgD, IgE, IgG or IgM, 
most preferably from IgM and IgG, producing cells. 

Methods for preparing fragments of genomic DNA from 

15 which immunoglobulin variable region genes can be cloned 
as a diverse population are well known in the art. See 
for example Herrmann et al., Methods In Enzvmol . , 152:180- 
183, (1987); Frischauf, Methods In Enzvmol . . 152:180-190 
(1987); Frischauf, Methods In Enzvmol . . 152:190-199 

20 (1987); and DiLella et al., Methods In Enzvmol . , 152:199- 
212 (1987) . (The teachings of the references cited herein 
are hereby incorporated by reference.) 

The desired gene repertoire can be isolated from 
either genomic material containing the gene expressing the 

25 variable region or the messenger RNA (mRNA) which repre- 
sents a transcript of the variable region. The difficulty 
in using the genomic DNA from other than non-rearranged B 
lymphocytes is in juxtaposing the sequences coding for the 
variable region, where the sequences are separated by 

3 0 intervening regions. The DNA fragment (s) containing the 
proper variable regions must be isolated, the intervening 
regions excised, and the variable regions then spliced in 
the proper order and in the proper orientation. For the 
most part, this will be difficult, so that the alternative 

35 technique employing rearranged B cells will be the method 
of choice because the V, D and J immunoglobulin gene 
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regions have translocated to become adjacent, so that the 
sequence is continuous for the variable regions. 

Where mRNA is utilized the cells will be lysed under 
RNase inhibiting conditions. In one embodiment, the first 
5 step is to isolate the total cellular mRNA by hybridiza- 
tion to an oligo-dT cellulose column. The presence of 
mRNAs coding for the heavy and/or light chain polypeptides 
can then be assayed by hybridization with DNA single 
strands of the appropriate genes. Conveniently, the 
10 sequences coding for the constant portion of the V H and V L 
can be used as polynucleotide probes, which sequences can 
be obtained from available sources. See for example, 
Early and Hood, Genetic Engineering , Setlow and 
Hollaender, eds., Vol. 3, Plenum Publishing Corporation, 
15 New York, (1981), pages 157-188; and Kabat et al., 
Sequences of Immunological Interest , National Institutes 
of Health, Bethesda, MD, (1987) . Exemplary methods for 
producing V H and V L gene repertoires are described in PCT 
Application No. PCT/US 90/02836 (International Publication 
20 No. WO 90/14430) . 

In preferred embodiments, the preparation containing 
the total cellular mRNA is first enriched for the presence 
of V H and/or V L coding mRNA. Enrichment is typically 
accomplished by subjecting the total mRNA preparation or 
25 partially purified mRNA product thereof to a primer 
extension reaction employing a polynucleotide synthesis 
primer of the present invention. 

According to another aspect of the present invention, 
a gene repertoire may be generated from one or a few 
3 0 nucleotide sequences by replicating those sequences under 
mutagenesis conditions so that a plurality of different 
nucleotide sequences or genes may be generated. Suitable 
mutagenesis conditions are known to those skilled in the 
art. 
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2 . Preparation Of Polynucl eotide Primers 
The term "polynucleotide" as used herein in reference 
to primers, probes and nucleic acid fragments or segments 
to be synthesized by primer extension is defined as a 
5 molecule comprised of two or more deoxyribonucleotides or 
ribonucleotides, preferably more than 3. Its exact size 
will depend on many factors, which in turn depends on the 
ultimate conditions of use. 

The term "primer" as used herein refers to a poly- 

10 nucleotide whether purified from a nucleic acid 
restriction digest or produced synthetically, which is 
capable of acting as a point of initiation of synthesis 
when placed under conditions in which synthesis of a 
primer extension product which is complementary to a 

15 nucleic acid strand is induced, i.e., in the presence of 
nucleotides and an agent for polymerization such as DNA 
polymerase, reverse transcriptase and the like, and at a 
suitable temperature and pH. The primer is preferably 
single stranded for maximum efficiency, but may alterna- 

2 0 tively be stranded. If double stranded, the primer is 
first treated to separate its strands before being used 
to prepare extension products. Preferably, the primer is 
a polydeoxyribonucleotide. The primer must be suffi- 
ciently long to prime the synthesis of extension products 

25 in the presence of the agents for polymerization. The 
exact lengths of the primers will depend on many factors, 
including temperature and the source of primer. For 
example, depending on the complexity of the target 
sequence, a polynucleotide primer typically contains 15 to 

30 25 or more nucleotides, although it can contain fewer 
nucleotides. Short primer molecules generally require 
cooler temperatures to form sufficiently stable hybrid 
complexes with template. 

The primers used herein are selected to be 

35 "substantially" complementary to the different strands of 
each specific sequence to be synthesized or amplified. 
This means that the primer must be sufficiently comple- 
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mentary to nonrandomly hybridize with its respective 
template strand. Therefore, the primer sequence need not 
reflect the exact sequence of the template. For example, 
a non-complementary nucleotide fragment can be attached to 
5 the 5 1 end of the primer, with the remainder of the primer 
sequence being substantially complementary to the strand. 
Such noncomplementary fragments typically code for an 
endonuclease restriction site. Alternatively, noncomple- 
mentary bases or longer sequences can be interspersed into 

10 the primer, provided the primer sequence has sufficient 
complementarily with the sequence of the strand to be syn- 
thesized to amplified to non-randomly hybridize therewith 
and thereby form an extension product under polynucleotide 
synthesizing conditions. 

15 Primers of the present invention may also contain a 

DNA-dependent RNA polymerase promoter sequence or its 
complement. See for example, Krieg et al., Nucleic Acids 
Research, 12:7057-70 (1984); Studier et al . , J . Mol . 
Biol. , 189:113-130 (1986); and Molecular Cloning: A 

20 Laboratory Manual, Second Edition . Maniatis et al. , eds. , 
Cold Spring Harbor, NY (1989) . 

When a primer containing a DNA-dependent RNA poly- 
merase promoter is used, the primer is hybridized to the 
polynucleotide strand to be amplified and the second 

25 polynucleotide strand of the DNA-dependent RNA polymerase 
promoter is completed using an inducing agent such as E . 
coli DNA polymerase I, or the Klenow fragment of E. coli 
DNA polymerase. The starting polynucleotide is amplified 
by alternating between the production of an RNA poly- 

3 0 nucleotide and DNA plynucleotide. 

Primers may also contain a template sequence or 
replication initiation site for a RNA-directed polymerase. 
Typical RNA-directed RNA polymerase include the QB repli- 
case described by Lizardi et al. Biotechnology . 6:1197- 

3 5 1202 (1988) . RNA-directed polymerases produce large 
numebrs of RNA strands from a small number of template RNA 
strands that contain a template sequence or replication 
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initiation site. These polymerases typically give a one 
million-fold amplification of the template strand, as has 
been described by Kramer et al., J- Mol. Biol.. 89:7819- 
736 (1974). 

5 The polynucleotide primers can be prepared using any 

suitable method, such as, for example, the phosphotriester 
on phosphodiester methods see Narang et al . , Meth. 
Enzvmol . , 68:90, (1979); U.S. Patent No. 4,356,270; and 
Brown et al., Meth. Enzvmol. . 68:109, (1979). 

10 The choice of a primer's nucleotide sequence depends 

on factors such as the distance on the nucleic acid from 
the region coding for the desired receptor, its hybrid- 
ization site on the nucleic acid relative to any second 
primer to be used, the number of genes in the repertoire 

15 it is to hybridize to, and the like. 

(a) Primers for Producing V„ and V L DNA Homoloas 

V H and V L gene repertoires can be separately prepared 
prior to their use in the methods of the present inven- 
tion. Repertoire preparation is typically done by primer 

2 0 extension (or other in vitro amplif icaiton method) , 

preferably by primer extension in a PCR format. 

For example, to produce V H -coding DNA homologs by 
primer extension, the nucleotide sequence of a primer is 
selected to hybridize with a plurality of immunoglobulin 
25 heavy chain genes at a site substantially adjacent to the 
V H -coding region so that a nucleotide sequence coding for 
a functional (capable of finding) polypeptide is obtained. 
To hybridize to a plurality of different V H -coding nucleic 
acid strands, the primer must be a substantial complement 

3 0 of a nucleotide sequence conserved among the different 

strands. Such sites include nucleotide sequences in the 
constant region, any of the variable region framework 
regions, preferably the third framework region, leader 
region, promoter region, J region and the like. 
35 if the V H -coding and V L -coding DNA homologs are to be 

produced by polymerase chain reaction (PCR) amplification, 
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two primers must be used for each coding strand of nucleic 
acid to be amplified. The first primer becomes part of 
the nonsense (minus or complimentary) strand and hybrid- 
izes to a nucleotide sequence conserved among V H (plus) 
5 strands within the repertoire. To produce V H coding DNA 
homologs, first primers are therefore chosen to hybridize 
to (i.e. be complementary to) conserved regions within the 
J region, CHI region, hinge region, C H 2 region, or C H 3 
region of immunoglobulin genes and the like. To produce 

10 a V L coding DNA homolog, first primers are chosen to 
hybridize with (i.e. be complementary to) a conserved 
region with the J region or constant region of immuno- 
globulin light chain genes and the like. Second primers 
become part of the coding (plus) strand and hybridize to 

15 a nucleotide sequence conserved among minus strands. To 
produce the V H -coding DNA homologs, second primers are 
therefore chosen to hybridize with a conserved nucleotide 
sequence at the 5 1 end of the V H -coding immunoglobulin gene 
such as in that area coding for the leader or first frame- 

2 0 work region. It should be noted that in the amplification 

of both V H - and V L -coding DNA homologs, the conserved 5 1 
nucleotide sequence of the second primer can be comple- 
mentary to a sequence exogenously added using terminal 
deoxynucleotidyl transferase as described by Loh et al., 
25 Science 243 :217-220 (1989). One or both of the first and 
second primers can contain a nucleotide sequence defining 
an endonuclease recognition site. The site can be heter- 
ologous to the immunoglobulin gene being amplified and 
typically appears at or near the 5' end of the primer. 

30 (b) Inside and Outside Primers 

In one embodiment, the present invention utilizes a 
set of polynucleotides that form inside primers comprised 
of an upstream inside primer and a downstream inside 
primer. Each of the inside primers has a priming region 

3 5 located at the 3 1 -terminus of the primer. The priming 

region is typically the S'-most ( 3 1 -terminal ) 15 to 30 
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nucleotide bases. The 3 1 -terminal priming portion of each 
inside primer is capable of acting as a primer to catalyze 
nucleic acid synthesis, i.e., initiate a primer extension 
reaction off its 3" terminus. One or both of the inside 
5 primers is further characterized by the presence of a 5'- 
terminal (5 f -most) non-priming portion, i.e., a region 
that does not participate in hybridization to repertoire 
template. 

In fusion PCR, each inside primer works in 
10 combination with an outside primer to amplify a target 
nucleic acid sequence. The choice of PCR primer pairs for 
use in fusion PCR as described herein is governed by the 
same considerations as previously discussed for choosing 
PCR primer pairs useful in producing gen repertoires. 
15 That is, the primers have a nucleotide sequence that is 
complementary to a sequence conserved in the repertoire. 
Useful V L and V H inside priming sequences are shown in 
Tables 1 and 2, respectively, below. 

Table 1 

20 3 1 Priming Portions of Various Inside V L Primers 



Seq. 








Id. No. 








(I) 1 


5' 


GTGATGACCCACTCTCC 3 


i 


(2) 


5 ' 


GTGATGACCCAGTCTCCA 


3 


(3) 


5 ' 


GTTGTGACTCAGGAATCT 


3 


(4) 


5' 


GTGTTGACGCAGCCGCCC 


3 


(5) 


5 • 


GTGCTCACCCAGTCTCCA 


3 


(6) 


5 • 


CAGATGACCCAGTCTCCA 


3 


(7) 


5 ' 


GTGATGACCCAGACTCCA 


3 


(8) 


5 ' 


GTCATGACCCAGTCTCCA 


3 


(9) 


5 » 


TTGATGACCCAAACTCAA 


3 


(10) 


5 ' 


GTGATAACCCAGGATGAA 


3 



1 Nucleotides sequences 1-10 are unique 5 1 primers for 
the amplification of kappa light chain variable 
3 5 regions. 
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Table 2 

3 ! Priming Portions of Various Inside V n Primers 

Seq. 

Id, No, 

5 (ll) 1 5' ACAAGATTTGGGCTC 3 1 

( 12 ) 2 5 1 TGGGGTTTTGAGCTC 3 1 

(13) 3 5* GAGACAGTGACCGGGTTCCTTGGCCCCA 3' 

(14) 4 5 1 TGGAATGGGCACATGCAG 3« 

(15) 5 5» TTATCATTTACCCGGAGA 3' 

10 (16) 6 5 1 AACGGTAACAGTGGTGCCTTGGCCCCA 3 1 

( 17 ) 7 5 1 ACAATCCCTGGGCACAAT 3 ■ 

(18) 8 5 1 CACCTTGGTGCTGCTGGC 3« 

( 19 ) 9 5 1 ACAACCACAATCCCTGGGCACAATTTT 3 1 

(20) 10 5 1 ACAATCCCTGGGCACAAT 3 1 

15 (21) 11 5* GAGTTCACTAGTTGGGCACGGTGGGCA 3 1 

1 Unique 3 1 primer for human IgGl, 2, 3, and 4 F.2d. 
Unique 3' primer for human V H amplification. 
3 1 primer for amplifying human heavy chain variable 
regions. 

20 4 3 1 primer for amplifying the Fd region of mouse IgM. 

5 3 1 primer located in the CH3 region of human IgGl to 
amplify the entire heavy chain. 

6 Unique 3 1 primer for amplification of mouse F v 

7 Unique 3 1 primer for amplification of mouse IgGl Fd. 
25 8 Unique 3 1 primer for amplification of VH including 

part of the mouse gamma 1 first constant region. 
9 Unique 3 1 primer for amplification of VH including 
part of mouse gamma 1 first constant region and hinge 
region. 

30 10 3' primer for amplifying mouse Fd including part of 
the mouse IgG first constant region and part of the 
hinge region. 

11 3' primer for amplifying human IgGl Fd including part 
of the human IgG first constant region and part of 
3 5 the hinge region including the two cysteines which 

create the disulfide bridge for producing Fab 1 2 (the 
primer corresponds to Kabat number 241QQ to 247) . 
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A preferred set of inside primers used herein has 
primers with complementary 5 1 -terminal non-priming 
regions, the complementary strands of which are capable of 
hybridizing to each other to form a duplex with 3 1 over- 
5 hangs. The duplex encodes all or part of a double 
stranded cistronic bridge. That is, if the 3 1 overhangs 
of the duplex are filled in with complementary bases so as 
to define a double stranded DNA extending from the 3'- 
terminus of one of the inside primers to the 3 "-terminus 

10 of the other of the inside primers, that double stranded 
DNA segment forms a sequence of nucleotides that opera- 
tively links the upstream and downstream cistrons for 
polycistronic expression. Thus, while each of the inside 
primers in a set contains only a portion of the sequence 

15 information necessary to form the double stranded 
cistronic bridge, the two inside primers in combination 
encode both the plus and minus strands of all or part of 
the bridge. 

For example, one inside upstream primer can have a 
20 sequence that forms a portion of the plus strand of the 
bridge, and the other inside primer encodes the sequence, 
through complementarity, of the downstream portion of the 
plus strand. 

In a preferred embodiment, the plus strand of the 
25 cistronic bridge contains, in the translational reading 
frame and from an upstream position to a downstream 
position, sequences coding for (i) at least one stop 
codon, preferably two, in the same reading frame as the 
upstream cistron, (ii) a ribosome binding site, and (iii) 
3 0 a polypeptide leader, the translation initiation codon of 
which is in the same reading frame as the downstream 
cistron. The stop codon is present to terminate transla- 
tion of the upstream cistron. The ribosome binding site 
is present to initiate translation of the downstream 
3 5 cistron from the polycistronic mRNA. 

The predicted amino acid residue sequences of two 
pelB gene product variants from Erwinia Carotova are shown 
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in Table 3. Lei, et al., supra . Amino Acid residue 
sequences for other leaders from E. coli useful in this 
invention are also listed in Table 3. Oliver, In 
Neidhart , F . C . ( ed . ) , Escherichia coli and Salmonella 
5 Typhimurium . American Society for Microbiology, 
Washington, D. C. , 1:56-69 (1987). These regions for the 
heavy chain are contained in the modified ImmunoZAP H 
expression vector. Mullinax, et al., Proc . Natl . Acad . 
Sci. . USA . 87:8095-8099 (1990). 



10 Table 3 

Leader Sequences 
Seq. 

Id. No. Type Amino Acid Residue Sequence 

(22) pelB 1 MetLysTyrLeuLeuProThrAlaAlaAlaGlyLeuLeu 
15 LeuLeuAlaAlaGlnProAlaGlnProAlaMetAla 

(23) pelB 2 MetLysSerLeuIleThrProIleAlaAlaGlyLeuLeu 

LeuAl aPheS er GlnTy r S er LeuAl a 

( 24 ) MalE 3 MetLysIleLysThrGlyAlaArglleLeiaAlaLeuSer 

AlaLeuThrThrMetMetPheSerAlaSerAlaLeuAla 
20 Lyslle 

(25) OmpF 3 MetMetLysArgAsnlleLeuAlaVallleValProAla 

LeuLeuValAlaGlyThrAlaAsnAlaAlaGlu 

(26) PhoA 3 MetLysGlnSerThrlleAlaLeuAlaLeuLeuProLeu 

LeuPheThrProValThrLysAlaArgThr 
25 ( 27 ) Bla 3 MetSerlleGlnHisPheArgValAlaLeuIleProPhe 

PheAlaAlaPheCysLeuProValPheAlaHisPro 
(28) LamB 3 MetMetlleThrLeuArgLysLeuProLeuAlaValAla 

ValAlaAlaGlyValMetSerAlaGlnAlaMetAlaVal 
Asp 

3 0 ( 29 ) Lpp 3 MetLysAlaThrLysLeuValLeuGlyAlaVallleLeu 

GlySerThrLeuLeuAlaGlyCysSer 



pelB from Erwinia carotovora gene 
pelB from Erwinia carotovora EC 16 gene 
3 5 3 leader sequences from E. coli 
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To achieve high levels of gene expression in E. coli, 
it is necessary to use not only strong promoters to gener- 
ate large quantities of mRNA, but also ribosome binding 
sites to ensure that the mRNA is efficiently translated. 
5 In E . coli , the ribosome binding site includes an initi- 
ation codon (AUG) and a sequence 3- nucleotides long 
located 3 11 nucleotides upstream from the initiation 
codon [Shine et al., Nature , 254:34 (1975)]. The 
sequence, AGGAGGU, which is called the Shine-Dalgarno (SD) 

10 sequence, is complementary to the 3 1 end of E. coli 16S 
mRNA. Binding of the ribosome to mRNA and the sequence at 
the 3 1 end of the mRNA can be affected by several factors: 
(i) The degree of complementarity between the SD 
sequence and 3 1 end of the 16S tRNA. 

15 (ii) The spacing and possibly the DNA sequence lying 

between the SD sequence and the AUG [Roberts et al., Proc. 
Natl. Acad, Sci. USA , 76:760 (1979A) ; Roberts et al . , 
Proc. Natl. Acad. Sci. USA , 76:5596 (1979B) ; Guarente et 
al., Science . 209:1428 (1980); and Guarente et al., Cell, 

20 20:543 (1980) .] Optimization is achieved by measuring the 
level of expression of genes in plasmids in which this 
spacing is systematically altered. Comparison of differ- 
ent mRNAs shows that there are statistically preferred 
sequences from positions -2 0 to +13 (where the A of the 

25 AUG is position 0) [Gold et al., Annu. Rev. Mic robiol., 
35:365 (1981)]. Leader sequences have been shown to 
influence translation dramatically (Roberts et al. 1979 a, 
b supra ) . 

(iii) The nucleotide sequence following the AUG, 
30 which affects ribosome binding [Taniguchi et al., J . Mol. 
Biol. , 118:533 (1978)]. 

Useful ribosome binding sites are shown in Table 4 
below. 
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Table 4 

Ribosome Binding Sites* 

Seq. 

Id. No. 



5 1. (30) 5" AAUCU UGGAGG CUUUUU UAUG GUUCGUUCU 

2. (31) 5 1 UAAC UAAGGAU GAAAUGC AUGU CUAAGACA 

3. (32) 5 1 UCC UAGGAGGUU UGACC UAUG CGAGCUUUU 

4. (33) 5 1 AUGUAC UAAGGAGGUU G UAUGG AACAACGC 



* Sequences of initiation regions for protein 
10 synthesis in four phage itiRNA molecules are underlined. 
AUG = initiation codon (double underlined) 

1. « phage 0X174 gene-A protein 

2. = Phage Q0 replicase 

3 . = Phage R17 gene-A protein 

15 4. = Phage lambda gene-cro protein 

It is preferred that the complementary (overlapping) 
region of the inside primers and the priming portion of 
the inside primers have about the same denaturation 
temperature, Td. The Td of a sequence can be estimated by 

20 the following formula: Td = 4(C+G) + 2 (A+T) , where C, G, 
A and T represent the respective number of cytosine, 
guanine, adenine and thymine bases in the sequence. A Td 
for the above- identified hybridizing region of about 45- 
55 °C, preferably about 50 °C, is preferred. Typically, 

25 overlapping regions in the range of about 15 to 20 
nucleotides works well in conjunction with the priming 
regions in the range of 15-3 0 nucleotides. 

The set of outside primers forms the termini of the 
dicistronic DNA molecule. The set of outside primers 

3 0 comprises an upstream outside primer and a downstream 
outside primer. The outside primers each comprise a 3 1 - 
terminal priming portion, and preferably a portion that 
defines an endonuclease restriction site. When present, 
the restriction site-defining portion is typically located 
35 in a 5 1 -terminal non-priming portion of the outside 
primer. The restriction site defined by the upstream 
outside primer is typically chosen to be one recognized by 
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a restriction enzyme that does not recognize the restric- 
tion site defined by the downstream outside primer, the 
objective being to be able to produce a dicistronic DNA 
having cohesive termini that are non-compelementary to 
5 each other and thus allow directional insertion into a 
vector. 

Useful outside primer sequences are shown in Tables 
5 and 6 below. 



Table 5 






Outside V n 


Primers 


Seq. 






Id. No. 






(34) 1 


5 


' AGGTCCAGCTGCTCGAGTCTGG3 ' 


(35) 


5 


* AGGTCCAGCTGCTCGAGTCAGG3 ' 


(36) 


5 


• AGGTCCAGCTTCTCGAGTCTGG3 » 


(37) 


5 


• AGGTCCAGCTTCTCGAGTCAGG3 • 


(38) 


5 


• AGGTCCAACTGCTCGAGTCTGG3 • 


(39) 


5 


• AGGTCCAACTGCTCGAGTCAGG3 ' 


(40) 


5 


• AGGTCCAACTTCTCGAGTCTGG3 • 


(41) 


5 


• AGGTCCAACTTCTCGAGTCAGG3 • 


(42) 2 


5 


• AGGTGCAGCTGCTCGAGTCTGG3 « 


(43) 


5 


•AGGTGCAGCTGCTCGAGTCGGG3 1 


(44) 


5 


1 AGGTGCAACTGCTCGAGTCTGG3 1 


(45) 


5 


•AGGTGCAACTGCTCGAGTCGGG3 1 



25 1 Nucleotide sequences 21-28 are unique 5 1 primers for 

the amplification of mouse V H genes. 
2 Nucleotide sequences 29-32 are unique 5' primers for 
amplification of nucleic acids coding for human 
variable regions. 



30 Table 6 

Outside V L Primers 

Seq. 

Id. No. 

(46) 1 5« ACGTCTAGATTCCACCTTGGTCCC 3» 

35 (47) 2 5 1 TCCTTCTAGATTACTAACACTCTCCCCTGTTGAA 3 f 
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(48) 3 


5 ' 


GCATTCTAGACTATTAACATTCTGTAGGGGC 3 ' 




(49) 4 


5 ' 


GCAGCATTCTAGAGTTTCAGCTCCAGCTTGCC 3 • 




(50) 5 


5 « 


CCGCCGTCTAGAACACTCATTCCTGTTGAAGCT 3 


t 


(51) 6 


5 ' 


CCGCCGTCTAGAACATTCTGCAGGAGACAGACT 3 


1 


(52) 7 


5' 


GCGCCGTCTAGAATTAACACTCATTCCTGTTGAA 


3 • 


(53) 8 


5' 


GCCGCTCTAGAACACTCATTCCTGTTGAA 3 ' 




(54) 9 


5' 


TCCTTCTAGATTACTAACACTCTCCCCTGTTGAA 


3 ' 


(55) 10 


5' 


GCATTCTAGACTATTATGAACATTCTGTAGGGGC 


3 • 



9 
10 



3 1 primer for amplifying human kappa chain variable 
regions . 

3 1 primer in human kappa light chain constant region. 
3 1 primer in human lambda light chain constant 
region. 

Unique 3 1 primer for amplification of kappa light 
chain variable regions. 

Unique 3 1 primer for mouse kappa light chain 

amplification including the constant region. 

Unique 3 1 primer for mouse lambda light chain 

amplification including the constant region. 

Unique 3 1 primer for amplification of kappa light 

chain . 

Unique 3 1 primer for amplification of mouse kappa 
light chain. 

Unique 3 1 primer for kappa V L amplification. 

Unique 3 1 primer for human, mouse and rabbit lambda 

V L amplification. 



3 . Preparing a Gene Library 

The strategy used for cloning, i.e., substantially 
reproducing the V H and/or V L genes contained within the 

3 0 isolated repertoire will depend, as is well known in the 
art, on the type, complexity, and purity of the nucleic 
acids making up the repertoire. Other factors include 
whether or not the genes are contained in one or a 
plurality of repertoires or populations and whether or not 

35 they are to be amplified and/or mutagnized. 
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a . Preparing V „ and V L libraries 

In one strategy, the object is to clone the V H - and/ or 
V L -coding genes from a repertoire comprised of polynucleo- 
tide coding strands, such as mRNA and/or the sense strand 
5 of genomic DNA. If the repertoire is in the form of 
double stranded genomic DNA, it is usually first dena- 
tured, typically by melting, into single strands. The 
repertoire is subjected to a first primer extension 
reaction by treating (contacting) the repertoire with a 

10 first polynucleotide synthesis primer having a preselected 
nucleotide sequence. The first primer is capable of ini- 
tiating the first primer extension reaction by hybridizing 
to a nucleotide sequence, preferably at least about 10 
nucleotides in length and more preferably at least about 

15 2 0 nucleotides in length, conserved within the repertoire. 
The first primer is sometimes referred to herein as the 
"sense primer" because it hybridizes to the coding or 
sense strand of a nucleic acid. In addition, the second 
primer is sometimes referred to herein as the "anti-sense 

20 primer" because it hybridizes to a non-coding or anti- 
sense strand to a nucleic acid, i.e., a strand 
complementary to a coding strand. 

The PCR reaction is performed by mixing the PCR pair, 
preferably a predetermined amount thereof, with the 

25 nucleic acids of the repertoire, preferably a predeter- 
mined amount thereof, in a PCR buffer to form a first PCR 
admixture. The admixture is maintained under polynucleo- 
tide synthesizing conditions for a time period, which is 
typically predetermined, sufficient for the formation of 

3 0 a PCR reaction product, thereby producing a gene library 
containing a plurality of different V H - and/or V L -coding 
DNA homo logs. 

A plurality of first primer and/or a plurality of 
second primers can be used in each amplification, e.g., 
3 5 one species of first primer can be paired with a number of 
second primers to form several different primer pairs. 
Alternatively, an individual pair of first and second 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



56 

primers can be used. In any case, the amplification 
products of amplifications using the same or different 
combinations of first and second primers can be combined 
to increase the diversity of the gene library. 
5 In another strategy, the object is to clone the V H - 

and/or V L -coding gene from a repertoire by providing a 
polynucleotide complement of the repertoire, such as the 
anti-sense strand of genomic dsDNA or the polynucleotide 
produced by subjecting mRNA to a reverse transcriptase 

10 reaction. Methods for producing such complements are well 
known in the art. The complement is subjected to a primer 
extension reaction similar to the above-described second 
primer extension reaction, i.e., a primer extension 
reaction using a polynucleotide synthesis primer capable 

15 to hybridizing to a nucleotide sequence conserved among a 
plurality of different V H -coding gene complements. 

The primer extension reaction is performed using any 
suitable method. Generally it occurs in a buffered aque- 
ous solution, preferably at a pH of 7-9, most preferably 

2 0 about 8. Preferably, a molar excess (for genomic nucleic 

acid, usually about 10 6 :1 primer: template) of the primer is 
admixed to the buffer containing the template strand. A 
large molar excess is preferred to improve the efficiency 
of the process. 

25 The deoxy ribonucleotide triphosphates dATP, dCTP, 

dGTP, and dTTP are also admixed to the primer extension 
(polynucleotide synthesis) reaction admixture in adequate 
amounts and the resulting solution is heated to about 
90°C-100°C for about 1 to 10 minutes, preferably from 1 

3 0 to 4 minutes. After this heating period the solution is 

allowed to cool to room temperature, which is preferable 
for primer hybridization. To the cooled mixture is added 
an appropriate agent for inducing or catalyzing the primer 
extension reaction, and the reaction is allowed to occur 
35 under conditions known in the art. The synthesis reaction 
may occur at from room temperature up to a temperature 
above which the inducing agent no longer functions effi- 
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ciently. Thus, for example, if DNA polymerase is used as 
inducing agent, the temperature is generally no greater 

than about 40°C. 

The inducing agent may be any compound or system 
5 which will function to accomplish the synthesis of primer 
extension products, including enzymes. Suitable enzymes 
for this purpose include, for example, E. cpli , DNA poly- 
merase I, Klenow fragment of E. cojj DNA polymerase I, T4 
DNA polymerase, other available DNA polymerases, reverse 
10 transcriptase, and other enzymes, including heat-stable 
enzymes, which will facilitate combination of the nucleo- 
tides in the proper manner to form the primer extension 
products which are complementary to each nucleic acid 
strand. Generally, the synthesis will be initiated at the 
15 3« end of each primer and proceed in the 5- direction 
along the template strand, until synthesis terminates, 
producing molecules of different lengths. There may be 
inducing agents, however, which initiate synthesis at the 
5- end and proceed in the above direction, using the same 
20 process as described above. 

The inducing agent also may be a compound or system 
which will function to accomplish the synthesis of RNA 
primer extension products, including enzymes. in 
preferred embodiments, the inducing agent may be a DNA- 
25 dependent RNA polymerase such as T7 RNA polymerase, T3 RNA 
polymerase or SP6 RNA polymerase. These polymerases 
produce a complementary RNA polynucleotide. The high turn 
overrate of the RNA polymerase amplifies the starting 
polynucleotide as has been described by Chamberlin et al. , 
30 The Enzvmes . ed. P. Boyer, PP. 87-108< Academic Press, New 
York (1982). Another advantage of T7 RNA polymerase is 
that mutations can be introduced into the polynucleotide 
synthesis by replacing a portion of cDNA with one or more 
mutagenic oligodeoxynucleotides (polynucleotides) and 
35 transcribing the partially-mismatched template directly as 
has been previously described by Joyce et al., Njicleic 
Research , 17:711-722 (1989). Amplification systems 
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based on transcription have been described by Gingeras et 

al «/ in PCR Protocol s, A Guide to Methods and 

Applications, pp. 245-252, Academic Press, Inc., San 
Diego, CA (1990) . 
5 if the inducing agent is a DNA-dependent RNA 

polymerase and therefore incorporates ribonucleotide 
triphosphates, sufficient amounts of ATP, CTP, GTP and UTP 
are admixed to the primer extension reaction admixture and 
the resulting solution is treated as described above. 

10 The newly synthesized strand and its complementary 

nucleic acid strand form a double-stranded molecule which 
can be used in the succeeding steps of the process. 

The first and/or second primer extension reaction 
discussed above can advantageously be used to incorporate 

15 into the multimeric polypeptide a preselected epitope 
useful in immunologically detecting and/or isolating a 
multimeric polypeptide. This is accomplished by utilizing 
a first and/or second polynucleotide synthesis primer or 
expression vector to incorporate a predetermined amino 

2 0 acid residue sequence into the amino acid residue sequence 

of the receptor. 

After producing V H - and/or V L -coding DNA homologs for 
a plurality of different V H - and/or V L -coding genes within 
the repertoire, the homologs are typically amplified. 
25 While the V H and/or V L -coding DNA homologs can be amplified 
by classic techniques such as incorporation into an auto- 
nomously replicating vector, it is preferred to first 
amplify the DNA homologs by subjecting them to a polymer- 
ase chain reaction (PCR) prior to inserting them into a 

3 0 vector. In fact, in preferred strategies, the first 

and/or second primer extension reactions used to produce 
the gene library are the first and second primer extension 
reactions in a polymerase chain reaction. 

PCR is typically carried out by cycling i.e., 
35 simultaneously performing in one admixture, the above 
described first and second primer extension reactions, 
each cycle comprising polynucleotide synthesis followed by 
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denaturation of the double stranded polynucleotides 
formed. Methods and systems for amplifying a DNA homolog 
are described in U.S. Patents No. 4,683,195 and No. 
4,683,202, both to Mullis et al . Preferably, PCR is 
5 carried out by thermocycling i.e., repeatedly increasing 
and decreasing the temperature of a PCR reaction admixture 
within a temperature range whose lower limit is about 10 °C 
to about 50 °C and whose upper limit is about 90 °C to about 
100 °C. The increasing and decreasing can be continuous, 

10 but is preferably phasic with time periods of relative 
temperature stability at each of temperatures favoring 
polynucelotide synthesis, denaturation and hybridization. 

In preferred embodiments only one pair of first and 
second primers is used per amplification reaction. The 

15 amplification reaction products obtained from a plurality 
of different amplifications, each using a plurality of 
different primer pairs, are then combined. 

However, the present invention also contemplated DNA 
homolog production via co-amplification (using two pairs 

20 of primers) , and multiplex amplification (using up to 
about 8, 9 or 10 primer pairs). 

The V H - and V L -coding DNA homologs produced by PCR 
amplification are typically in double-stranded form and 
have contiguous or adjacent to each of their termini a 

25 nucleotide sequence defining an endonuclease restriction 
site. Digestion of the V H - and V L -coding DNA homologs 
having restriction sites at or near their termini with one 
or more appropriate endonucleases results in the produc- 
tion of homologs having cohesive termini of predetermined 

30 specificity. 

In preferred embodiments, the PCR process is used not 
only to amplify the v H - and/or V L -coding DNA homologs of 
the library, but also to induce mutations within the 
library and thereby provide a library having a greater 

3 5 heterogeneity. First, it should be noted that the PCR 
processes itself is inherently mutagenic due to a variety 
of factors well known in the art. Second, in addition to 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



60 

the mutation inducting variations described in the above 
referenced U.S. Patent No. 4 , 683, 195 , other mutation 
inducing PCR variations can be employed. For example, the 
PCR reaction admixture, i.e., the combined first and 
5 second primer extension reaction admixtures, can be formed 
with different amounts of one or more of the nucleotides 
to be incorporated into the extension product. Under such 
conditions, the PCR reaction proceeds to produce nucleo- 
tide substitutions within the extension product as a 

10 result of the scarcity of a particular base. Similarly, 
approximately equal molar amounts of the nucleotides can 
be incorporated into the initial PCR reaction admixture in 
an amount to efficiently perform X number of cycles, and 
then cycling the admixture through a number of cycles in 

15 excess of X, such as, for instance, 2X. Alternatively, 
mutations can be induced during the PCR reaction by incor- 
porating into the reaction admixture nucleotide deriva- 
tives such as inosine, not normally found in the nucleic 
acids of the repertoire being amplified. During subse- 

20 quent in vivo amplification, the nucleotide derivative 
will be replaced with a substitute nucleotide thereby 
inducting a point mutation. 

b. Preparing a Dicistronic DNA molecule Library 
In one embodiment, a library of dicistronic DNA 
25 molecules containing upstream and downstream cistrons 
operatively linked by a cistronic bridge can be produced 
by the following steps: 

(a) Subjecting a repertoire of first polypeptide 
genes (e.g., V H -coding genes), to PCR amplification using 

30 first outside and first inside primers, i.e., a first PCR 
primer pair, to form a first primary PCR product. 

(b) Subjecting a repertoire of second polypeptide 
genes (e.g., V L -coding genes) to PCR amplification using 
second outside and second inside pirmers, i.e., a second 

3 5 PCR primer pair, to form a second primary PCR product. 
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(c) Hybridizing the first and second primary PCR 
products to form internally (self) primed duplexes, i.e., 
duplexes having 3 ■ -hybridized and 5 1 -overhanging termini. 

(d) Subjecting the internally-primed duplexes to 
5 primer extension reaction conditions to form double 

stranded duplexes having substantially blunt, preferably 
blunt, termini and a dicistronic strand containing the 
upstream and downstream cistrons linked by a cistronic 
bridge encoded by the inside primers. By "substantially 

10 blunt" is meant having no more than about one or two 
overhanging nucleotides . (Substantially blunt double 
stranded DNA is sometimes produced by primer overextension 
by Taq polymerase, usually by the addition of one or two 
terminal adenine residues.) 

15 The V H - and V L -coding gene repertoires are comprised 

of polynucleotide coding strands, such as mRNA and/or the 
sense strand of genomic DNA. If the repertoire is in the 
form of double stranded genomic DNA, it is usually first 
denatured, typically by melting, into single strands. A 

20 repertoire is subjected to a PCR reaction as described in 
Section 3a hereinabove. 

In preferred embodiments the ratio of gene molecules 
and their respective primers is as follows: about 1 x 10 3 
V H gene molecules to about 1 x 10 8 outside V H gene molecules 

25 to about 1 x 10 8 outside V H primer molecules, about 1 x 10 3 
V H gene molecules, to about 1 x 10 7 inside V H gene primer 
molecules, about 1 x 10 3 V L gene molecules to about 1 x 10 8 
outside V L gene primer molecules, about 1 x 10 4 V L gene 
molecules to about 1 x 10 7 V L gene primer molecules. In 

3 0 more preferred embodiments, 10 4 outside V H gene primer 
molecules and 10 3 inside V H gene primer molecules are used 
for every V H gene molecule present in the PCR admixture. 
Similarly, 10 4 outside V L gene primer molecules and 10 3 V L 
gene molecule present in the PCR admixture. Thus, there 

3 5 is typically a 10 fold molar excess of outside primer to 
inside primer. 
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In the fusion PCR reaction, the gene repertoires are 
admixed with outside and inside primers, the outside 
primers being present in excess relative to the inside 
primers. The initial PCR thermocycles produce intermedin 
5 ate products having complementary termini from each of the 
first and second gene repertoires. That is, the end of 
one strand from one primary PCR product is capable of 
hybridizing with the complementary end from the other 
primary PCR product. The strands having the overlap at 

10 their 3 1 ends can act as primers for one another, i.e., 
from an internally primed duplex, and be extended by the 
polymerase to form the full length final product. The 
final product is then amplified by the set of outside 
primers, which act as a third PCR pair when the inside 

15 primers have been exhausted, to form a secondary PCR 
product. Typically the molar ratio of outside primers to 
inside primers is such that the inside primers are 
effectively exhausted within about 2 to about 12, 
preferably about 5, 6 or 7 thermocycles. 

2 0 The PCR buffer also contains the deoxyribonucleotide 

triphosphates dATP, dCTP, dGTP, and a polymerase, typic- 
ally thermostable, all in adequate amounts for primer 
extension (polynucleotide synthesis) reaction. The 
resulting solution (PCR admixture) is heated to about 90 °C 
25 - 100 °C for about 1 to 10 minutes, preferably from 1 to 4 
minutes. After this heating period the solution is 
allowed to cool to 54 °C, which is preferably for primer 
hybridization. The synthesis reaction may occur at from 
room temperature up to a temperature above which the poly- 

3 0 merase (inducing agent) no longer functions efficiently. 

Thus, for example, if DNA polymerase is used as inducing 
agent, the temperature is generally no greater than about 
40 °C. An exemplary PCR buffer comprises the following: 
50 mM KC1; 10 mM Tris-HCl; pH 8.3; 1.5 mM MgCl 2 ; 0.001% 
35 (wt/vol) gelatin, 200 /xM dATP; 200 £iM dTTP; 200 /zM dCTP; 
200 /iM dGTP; and 2.5 units Thermus acauaticus DNA poly- 
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merase I (U.S. Patent No. 4,889,818) per 100 microliters 
of buffer. 

After producing operatively linked V H - and V L -coding 
DNA hoiaologs for a plurality of different V H - and V L - 
5 coding genes within the repertoires, the dicistronic DNA 
molecules are typically further amplified. While the 
dicistronic DNA molecules can be amplified by classic 
techniques such as incorporation into an autonomously 
replicating vector, it is preferred to first amplify the 

10 molecules by subjecting them to a polymerase chain reac- 
tion (PCR) prior to inserting them into a vector. In 
fact, in preferred strategies, the first and second PCR 
reactions are performed in the same admixture that is 
subject to a multiplicity of PCR thermocycles where the 

15 outside primers are in molar excess. Preferably the 
number of PCR thermocycles is at least n+5, wherein n is 
the number of PCR thermocycles necessary to decrease by a 
factor of 10, and preferably exhaust, the number of inside 
primers by consumption in the formation of inside primer- 

2 0 primed products. 

A diverse library of dicistronic DNA molecules having 
upstream and downstream cistrons can also be produced by 
combining, in a PCR buffer, double stranded V H and V L 
repertoires, V H and V L outside primers, and an inside 
25 primer having a 3'-teminal priming portion, a cistronic 
bridge coding portion , and a 5 '-terminal inside primer- 
template (primer-coding) portion. The 3 1 -terminal priming 
portion has a nucleotide base sequence complementary to a 
portion of the primer extension product of one of the 

3 0 outside primers. The 5 '-terminal primer-template portion 

has a nucleotide base sequence homologous (identical) to 
a protion of the primer extension product of the other of 
the outside primers. That is, the linking primer has 
terminal sequences homologous to sequences in both reper- 
3 5 toires. The cistronic bridge coding portion codes for, 
either directly or through complementarily , at least one 
stop codon in the same reading frame as the upstream 
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cistron and sequences for the expression of the downstream 
cistron. 

The dicistronic DNA molecules containing operatively 
linked V H - and V L -coding DNA homologs produced by PCR 
5 amplification are typically in double-stranded in form and 
may have contiguous or adjacent to each of their termini 
a nucleotide sequence defining an endonuclease restriction 
site. Digestion of the dicistronic DNA molecules having 
restriction sites at or near their temini with one or more 
10 appropriate endonucleases results in the production of DNA 
molecules having cohesive termini of predetermined 
specificity. 

When individual PCR admixtures contain diverse gene 
repertoires the present invention produces many non- 
15 naturally occurring antibodies, i.e., combinations of V H 
and V L in a heterodimer. To take advantage of the 
mammalian immune system's capacity to select V H and V L 
combinations, the present invention also contemplates 
using fusion PCR to operatively link, and thereby recover, 
20 naturally occurring V H and V L combinations. 

In certain preferred embodiments, a fusion PCR method 
is performed on repertoires comprising a plurality of 
substantially isolated cells containing genes coding for 
a heterodimeric receptor. For example, a plurality of PCR 
25 admixtures is formed, each of which contains (i) a sample 
of substantially isolated B lymphocytes from a mammal pro- 
ducing antibody molecules against a preselected antigen, 
(ii) a PCR buffer, and (iii) either the previously 
described V H and V L PCR primer pairs or the set of outside 
3 0 V H and V L PCR primers in combination with the linking 
primer (s) , also as previously described. The plurality of 
PCR admixtures is then subjected to a multiplicity of PCR 
thermocycles as described herein. 

By "substantially isolated" is meant a sample 
35 containing less than about 100 target cells, such as B 
lymphocytes, T cells, and the like. In preferred embodi- 
ments, the plurality of PCR admixtures contain only about 
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one cell. The cells are typically obtained from an indi- 
vidual mammal whose serum contains antibody molecules 
against the preselcted antigen. The collected cells are 
typically seeded, usually at densities in the range of 0.5 
5 to 100 cells per unit volume, into a plurality of indi- 
vidual PCR vessels, such as microtiter plate wells and the 
like. Usually, the plurality of PCR admixtures is in the 
range of 800 to 1200, and preferably is about 1000, 
separate admixtures. 

10 Typically, fewer cells are needed in each PCR 

admixture where the cells are obtained from individuals 
expressing a high serum antibody titer against the pre- 
selected antigen. For example, where B lymphocytes are 
obtained from an individual having a frequency of circu- 

15 lating B cells producing the antibody molecules of 
preselected specificity of 1/3000, each of about 800 to 
12 00 individual PCR admixtures need only contain about one 
B lymphocyte to result in isolation of the desired anti- 
body. Where the circulating B cell frequency is in the 

20 range of 1/500,000, a density of about 100 cells per PCR 
admixture in each of about 800 to 1200 individual PCR 
admixtures will be needed before the process will result 
in isolation of the desired antibody. 

In preferred embodiments, the PCR process is used not 

25 only to produce a library of dicistronic DNA molecules, 
but also to induce mutations within the library or to 
create diversity from a single parental clone and thereby 
provide a library having a greater heterogeneity as noted 
in Section 3a hereinabove. 

3 0 4 . Expression 

A. Expressing the V „ and/or V L DNA Homoloas. 

The V H - and/or V L -coding DNA homologs contained within 
the library produced by the above-described method can by 
operatively linked to a vector for amplification and/or 

3 5 expression. 
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The choice of vector to which a V H - and/ or V L -coding 
DNA homolog is operatively linked depends directly, as is 
well known in the art, on the functional properties 
desired, e.g., replication or protein expression, and the 
5 host cell to be trans formed, these being limitations 
inherent in the art of constructing recombinant DNA 
molecules. In preferred embodiments, the vector utilized 
includes a procaryotic replicon i.e., a DNA sequence 
having the ability to direct autonomous replication and 

10 maintenance of the recombinant DNA molecule extra chromo- 
somally in a procaryotic host cell, such as a bacterial 
host cell, transformed therewith. Such replicons are well 
known in the art. In addition, those embodiments that 
include a procaryotic replicon also include a gene whose 

15 expression confers a selective advantage, such as drug 
resistance, to a bacterial host transformed therewith. 
Typical bacterial drug resistance genes are those that 
confer resistance to ampicillin or tetracycline. 

Those vectors that include a procaryotic replicon can 

20 also include a procaryotic promoter capable of directing 
the expression (transcription and translation) of the V H - 
and/or V L -coding homologs in a bacterial host cell, such 
as E. coli transformed therewith. A promoter is an 
expression control element formed by a DNA sequence that 

25 permits binding of RNA polymerase and transcription to 
occur. Promoter sequences compatible with bacterial hosts 
are typically provided in plasmid vectors containing 
convenience restriction sites for insertion of a DNA 
segment of the present invention. Typical of such vector 

30 plasmids are pUC8 , pUC9 , pBR322, and pBR329 available from 
BioRad Laboratories, (Richmond, CA) and pPL and pKK223 
available from Pharmacia, (Piscataway, NJ) . 

Promoters contain two highly conserved regions, one 
located about 10 bp (-10 region on Priberrow box) and the 

35 other about 35 bp (-35 region) upstream from the point at 
which transcription starts. These two regions typically 
determine promoter strength. In addition, the number of 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



67 

nucleotides atht separate the conserved sequences is 
important for efficient promoter function. For example, 
16 to 19 nucleotides typically separate the -10 and -3 5 
regions, and changes in that psacing can change the 
5 efficiency of a promoter. 

Promoters useful in this invention include Ptac 
0 1.1A, 0 1.1B and <p 10, which are recognized by T7 
polymerase. See U.S. Patent No. 4,946,786. Useful 
regulatable promoters include the E. coli lac promoter 

10 described in U.S. Patent No. 4,936,786 and the promoters 
for the temperature sensitive genes in U.S. Patent No. 
4,806,471. See also U.S. Patent No. 4,711,845. 

Expression vectors compatible with eukaryotic cells, 
preferably those compatible with vertebrate cells, can 

15 also be used. Eukaryotic cell expression vectors are well 
known in the art and are available from several commercial 
sources. Typically, such vectors are provided containing 
convenient restriction sites for insertion of the desired 
DNA homologue. Typical of such vectors are pSV L and pKSV- 

20 10 (Pharmacia), pBPV-l/PML2d (International 
Biotechnologies, Inc.), and pTDTl (ATCC, No. 31255). 

In preferred embodiments, the eukaryotic cell 
expression vectors used include a selection marker that is 
effective in an eukaryotic cell, preferably a drug resist- 

2 5 ance selection marker. A preferred drug resistance marker 

is the gene whose expression results in neomycin resist- 
ance, i.e., the neomycin phosphotransferase (neo) gene. 
Southern et al., J. Mol. Appl. Genet. , 1:327-341 (1982). 
The use of retroviral expression vectors to express 

3 0 the genes of the V H and/or V L -coding DNA homologs is also 

contemplated. As used herein, the term "retroviral 
expression vector" refers to a DNA molecule that includes 
a promoter sequences derived from the long terminal repeat 
(LTR) region of a retrovirus genome. 
35 m preferred embodiments, the expression vector is 

typically a retroviral expression vector that is prefer- 
ably replication-incompetent in eukaryotic cells. The 
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construction and use of retroviral vectors has been 
described by Sorge et al., Mol, Cel. Biol, , 41730-1737 
1984) . 

A variety of methods have been developed to opera - 
5 tively link DNA to vectors via complementary cohesive 
termini. For instance, complementary cohesive termini can 
be engineered into the V H - and/or V L -coding DNA homologs 
during the primer extension reaction by use of an appro- 
priately designed polynucleotide synthesis primer, as 

10 previously discussed. The vector, and DNA homolog if 
necessary, is cleaved with a restriction endonuclease to 
produce termini complementary to those of the DNA homolog. 
The complementary cohesive termini of the vector and the 
DNA homolog are then operatively linked (ligated) to 

15 produce a unitary double stranded DNA molecule. 

In preferred embodiments, the V H -coding and V L -coding 
DNA homologs of diverse libraries are randomly combined in 
vitro for polycistronic expression from individual 
vectors. That is, a diverse population of double stranded 

2 0 DNA expression vectors is produced wherein each vector 

expresses, under the control of a single promoter, one V H - 
coding DNA homolog and one V L -coding DNA homolog, the 
diversity of the population being the result of different 
V H - and V L -coding DNA homolog combinations. 
25 Random combination in vitro can be accomplished using 

two expression vectors distinguished from one another by 
the location on each of a restriction site common to both. 
Preferably the vectors are linear double stranded DNA, 
such as a Lambda Zap derived vector as described herein. 

3 0 In the first vector, the site is located between a promo- 

ter and a polylinker, i.e., 5 1 terminal (upstream relative 
to the direction of expression) to the polylinker by 3 1 
terminal (downstream relative to the direction of expres- 
sion) . In the second vector, the polylinker is located 
35 between a promoter and the restriction site, i.e., the 
restriction site is located 3' terminal to the polylinker, 
and polylinker is located 3* terminal to the promoter. 
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In preferred embodiments, each of the vectors defines 
a nucleotide sequence coding for a ribosome binding and a 
leader, the sequence being located between the promoter 
and the polylinker, but downstream (3' terminal) from the 
5 shared restriction site if that site is between the promo- 
ter and polylinker. Also preferred are vectors containing 
a stop codon downstream from the polylinker, but upstream 
from any shared restriction site if that site is down- 
stream from the polylinker. The first and/ or second 

10 vector can also define a nucleotide sequence coding for a 
peptide tag. The tag sequence is typically located down- 
stream from the polylinker but upstream from any stop 
codon that may be present. 

In preferred embodiments, the vectors contain 

15 selectable markers such that the presence of a portion of 
that vector, i.e. a particular lambda arm, can be selected 
for or selected against. Typical selectable markers are 
well known to those skilled in the art. Examples of such 
markers are antibiotic resistance genes, genetically 

20 selectable markers, mutation suppressors such as amber 
suppressors and the like. The selectable markers are 
typically located upstream of the promoter and/or down- 
stream of the second restriction site. In preferred 
embodiments, one selectable marker is located upstream of 

25 the promoter on the first vector containing the V H -coding 
DNA homologs. A second selectable marker is located down- 
stream of the second restriction site on the vector con- 
taining the V L -coding DNA homologs. This second selectable 
marker may be the same or different from the first as long 

30 as when the V H -coding vectors and the V L -coding vectors are 
randomly combined via the first restriction site the 
resulting vectors containing both V H and V L and both 
selectable markers can be selected. 

Typically the polylinker is a nucleotide sequence 

35 that defines one or more, preferably at least two, 
restriction sites, each unique to the vector, i.e., if it 
is on the first vector, it is not on the second vector. 
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The poly linker restriction sites are oriented to permit 
ligation of V H - or V L -coding DNA homologs into the vector 
in same reading frame as any leader, tag or stop codon 
sequence present. 
5 Random combination is accomplished by ligating V H - 

coding DNA homologs into the first vector, typically at a 
restriction site or sites within the polyl inker. 
Similarly, V L -coding DNA homologs are ligated into the 
second vector, thereby creating two diverse populations of 

10 expression vectors. It does not matter which type of DNA 
homolog, i.e., V H or V L , is ligated to which vector, but it 
is preferred, for example, that all V H -coding DNA homologs 
are ligated to either the first of second vector, and all 
of the V L -coding DNA homologs are ligated to the other of 

15 the first or second vector. The members of both popula- 
tions are then cleaved with an endonuclease at the shared 
restriction site, typically by digesting both populations 
with same enzyme. The resulting product is two diverse 
populations of restriction fragments where the members of 

20 one have cohesive termini complementary to the cohesive 
termini of the members of the other. The restriction 
fragments of the two populations are randomly ligated to 
one another, i.e., a random, interpopulation ligation is 
performed, to produce a diverse population of vectors each 

25 having a V H -coding and V L -coding DNA homolog located in the 
same reading frame and under the control of second 
vector's promoter. Of course, subsequent recombinations 
can be effected through cleavage at the shared restriction 
site, which is typically reformed upon ligation of members 

3 0 from the two populations, followed by subsequent 
religations. 

The resulting construct is then introduced into an 
appropriate host to provide amplification and/or expres- 
sion of the v H - and/or V L -coding DNA homologs, either 
35 separately or in combination. When coexpressed within the 
same organism, either on the same or the difference 
vectors, a functionally active Fv is produced. When the 
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V H and V L polypeptides are expressed in different organ- 
isms, the respective polypeptides are isolated and then 
combined in an appropriate medium to form a Fv. Cellular 
hosts into which a V H - and/or V L -coding DNA homolog- 
5 containing construct has been introduced are referred to 
herein as having been "transformed" or as "transf ormants" . 

The host cell can be either procaryotic or eucary- 
otic. Bacterial cells are preferred procaryotic host 
cells and typically are a strain of E, coli such as, for 

10 example, the E. coli strain DH5 available from Bethesda 
Research Laboratories, Inc., Bethesda, MD. Preferred 
eucaryotic host cells include yeast and mammalian cells, 
preferably vertebrate cells such as those from a mouse, 
rat, monkey or human cell line. 

15 Transformation of appropriate cell hosts with a 

recombinant DNA molecule of the present invention is 
accomplished by methods that typically depend on the type 
of vector used. With regard to transformation of procary- 
otic host cells, see, for example, Cohen et al., 

20 Proceedings National Academy of Science , USA Vol. 69, P. 
2110 (1972); and Maniatis et al., Molecul ar Cloning , — a 
Laboratory manual , Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY (1982). With regard to the transform- 
ation of vertebrate cells with retroviral vectors 

25 containing rDNAs, see for example, Sorge et al., Mol . 
Cell. Biol. , 4:1730-1737 (1984); Graham et al., Virol . , 
52:456 (1973); and Wigler et al . , Proceedings National 
Academy of Sciences . USA, Vol. 76, P. 1373-1376 (1979). 

b. Expressing the Dicistronic DNA Molecules 
3 0 The dicistronic DNA molecules produced by the above- 

described method can be operatively linked to a vector for 
amplification and/or expression. 

A variety of methods have been developed to oepra- 
tively link DNA to vectors via complemenetary cohesive 
3 5 termini. For instance, complementary cohesive termini can 
be engineered into the dicistronic DNA molecules during 
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the primer extension reaction by use of an appropriately 
designed polynucleotide synthesis primer, as previously 
discussed. The dicistronic DNA molecule, and vector if 
necessary, is cleaved with a restriction endonuclease to 
5 produce termini complementary to htose of the vector. The 
complementary cohesive termini of the vector and the 
dicistronic DNA molecule are then operatively linked 
(ligated) to produce a unitary double stranded DNA 
molecule. 

10 The present method produces a diverse population of 

double stranded DNA expression vectors wherein each vector 
expresses, under the control of a single promoter, one V H - 
coding DNA homolog and one V L -coding DNA homolog, the 
diversity of the populuation being the result of different 

15 V H - and V L -coding DNA homolog combination that occurs 
during the PCR reaction where both outside and both inside 
primers are present in effective amounts. Preferably the 
vectors are linear double stranded DNA, such as a Lambda 
Zap derived vector as described herein. 

2 0 In preferred embodiments, the vector defines a 

nucleotide sequence coding for a ribosome binding site and 
a leader, the sequence being located downstream from a 
promoter and upstream from a sequence ocding for apoly- 
peptide leader. In preferred embodiments, the vector 
25 contains a selectable marker such that the presence of a 
dicistronic DNA molecule of this invention inserted into 
the vector, can be selected. Typical selectable markers 
are well known to those skilled in the art. Examples of 
such markers are antibiotic resistance genes, genetically 

3 0 selectable markers, mutation suppressors such as amber 

supppressors and the like. The selectable markers are 
typically located upstream of the promoter. 

The resulting construct is then introduced into an 
appropriate host to provide amplification and/or expres- 
35 sion of the V H - and V L -coding DNA homologs. When 
coexpressed within the same organism, a functionally 
active heterodimeric receptor, such as an F v , is produced. 
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Cellular hosts into which a V H - and V L -coding DNA homolog- 
containing constructu has been introduced are referred to 
herein as having been "transformed" or as " trans f ormants" . 
The host cell can be either prokaryotic or 
5 eukaryotic. Bacterial cells are preferred prokaryotic 
host cells for library screening, and typically are a 
strain of E. coli such as, for example, the E. coli strain 
DH5 available from Bethesda Research Laboratories, Inc., 
Bethesda, MD. Preferred eukaryotic host cells include 
10 yeast and mammalian cells, preferably vertebrate cells 
such as those from a mouse, rat, monkey or human cell 
line. 

Transformation of appropriate cell hosts with a 
recombinant DNA molecule of the present invention is 

15 accomplished by methods that typically depend on the type 
of vector used. With regard to transformation of prokary- 
otic host cells, see, for example, Cohen et al., Proc. 
Natl. Acad. Sci. . USA, 69:2110 (1972); and Maniatis et 
al. , Molecular Cloning: A Laboratory Manual , Cold Spring 

20 Harbor, NY (1982) . With regard to the transformation of 
vertebrate cells with retorviral vectors containing rDNAs , 
see for example, Sorge et al., Mol. Cell. Biol. . 4:1730- 
1737 (1984); Graham et al., Virol., 52:456 (1973); and 
Wigler et al., Proc . Natl . Acad . Sci . , USA, 76:1373-1376 

25 (1979) . 

5 . Screening For Expression of V H and/or V L Polypeptides 
Successfully transformed cells, i.e., cells contain- 
ing a V H - and/or V L -coding DNA homolog or a dicistromic DNA 
molecule operatively linked to a vector, can be identified 

3 0 by any suitable well known technique for detecting the 
binding of a receptor to a ligand or the presence of a 
polynucleotide coding for the receptor, preferably its 
active site. Preferred screening assays are those where 
the binding of ligand by the receptor produces a detect- 

3 5 able signal, either directly or indirectly. Such signals 
include, for example, the production of a complex, 
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formation of a catalytic reaction product, the release or 
uptake of energy, and the like. For example, cells from 
a population subjected to transformation with a subject 
rDNA can be cloned to produce monoclonal colonies. Cells 
5 form those colonies can be harvested, lysed and their DNA 
content examined for the presence of the rDNA using a 
method such as that described Southern, J. Mol . Biol . , 
98:503 (1975) or Berent et al., Biotech. 3:208 (1985). 

In addition to directly assaying for the presence of 

10 a V H - and/or V L -coding DNA homolog or a dicistronic DNA 
molecule, successful transformation can be confirmed by 
well known immunological methods, especially when the V H 
and/or V L polypeptides produced contain a preselected 
epitope. For example, samples of cells suspected of being 

15 transformed are assayed for the presence of the 
preselected epitope using an antibody against the epitope. 

6 . V M - And/Or V L -Codina Gene Libraries 

According to one aspect, the present invention 
contemplates a gene library, preferably produced by a 

2 0 primer extension reaction or combination of primer 

extension reactions as described herein, containing at 
least about 10 3 , preferably at least about 10 4 and more 
preferably at least about 10 5 different V H - and/or V L - 
coding DNA homologs. The homologs are preferably in an 
25 isolated form, that is, substantially free of materials 
such as, for example, primer extension reaction agents 
and/or substrates, genomic DNA segments, and the like. 

In preferred embodiments, a substantial portion of 
the homologs present in the library are operatively linked 

3 0 to a vector, preferably operatively linked for expression 

to an expression vector. 

Preferably, the homologs are present in a medium 
suitable for in vitro manipulation, such as water, water 
containing buffering salts, and the like. The medium 
35 should be compatible with maintaining the activity of the 
homologs. In addition, the homologs should be present at 
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a concentration sufficient to allow transformation of a 
host cell compatible therewith at reasonable frequencies. 

It is further preferred that the homologs be present 
in compatible host cells transformed therewith. 

5 C. Expression Vectors 

The present invention also contemplates various 
expression vectors useful in performing, inter alia, the 
methods of the present invention. Each of the expression 
vectors is a novel derivative of Lambda Zap vector. 

10 1. Lambda Zap II 

Lambda Zap II is prepared by replacing the Lambda S 
gene of the vector Lambda Zap with the Lambda S gene from 
the Lambda gtlO vector, as described in Example 6. 

2 . Lambda Zap II V„ 

15 Lambda Zap II V H is prepared by inserting the 

synthetic DNA sequences illustrated in Figure 6A into the 
above-described Lambda Zap II vector. The inserted 
nucleotide sequence advantageously provides a ribosome 
binding site (Shine-Dalgarno sequence) to permit proper 

20 initiation of mRNA translation into protein, and a leader 
sequence to efficiently direct the translated protein to 
the periplasm. The preparation of Lambda Zap II V H is 
described in more detail in Example 9, and its features 
illustrated in Figures 6A and 7 . 

25 3. Lambda Zap II V L 

Lambda Zap II V L is prepared as described in Example 
12 by inserting into Lambda Zap II the synthetic DNA 
sequence illustrated in Figure 6B. Important features of 
Lambda Zap II V L are illustrated in Figure 8. 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



76 

4. Lambda Zap II V L II 

Lambda Zap II V L II is prepared as described in 
Example 11 by inserting into Lambda Zap II the synthetic 
DNA sequence illustrated in Figure 10, 

5 5. HCFLP 

HCFLP is prepared as described in Example 2 0 by 
inserting a flp sequence containing EcoRI compatible ends 
into the EcoRI site of the lambda Zap II V H vector. 

6. LCFLP 

10 LCFLP is prepared as described in Example 2 0 by 

inserting a flp sequence containing EcoRI compatible ends 
into the EcoRI site of the lambda Zap II V L vector. 

7 . Lambda ImmunoZAP H 

Lambda ImmunoZAP H is prepared by inserting the 
15 synthetic DNA sequences illustrated in Figure 25A into the 
above-described Lambda Zap II vector. The inserted 
nucleotide sequence advantageously provides a ribosome 
binding site (Shine-Dalgarno sequence) to permit proper 
initiation of mRNA translation into protein, and a leader 

2 0 sequence to effieicntly direct the translated protein to 

the periplasm. The preparation of Lambda ImmunoZAP H is 
described in more detail in Example 28, and its features 
illustrated in Figures [25A] and [26]. 

8. Modified Lambda ImmunoZAP H 

25 Modified Lambda ImmunoZAP H is prepared by inserting 

the modified synthetic DNA sequences illustrated in Figure 
8A into the above-described Lambda ZAP II vector. The 
preparation of modified Lambda ImmunoZAP H and the details 
of the modifications are described in Example 28B. Its 

3 0 features are illustrated in Figure [24A] and [24B] • 
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9 . Lambda TTtmuinoZAP L 

Lambda ImmunoZAP L is prepared as described in 
Example 29 by inserting into Lambda ZAP II the synthetic 
DNA sequence illustrated in Figure 6B. Important features 
5 of Lambda ImmunoZAP L are illustrated in Figure 27. 

The above-described vectors are compatible with EL. 
coli hosts, i.e. . they can express for secretion into the 
periplasm proteins coded for by genes to which they have 
been operatively linked for expression. 

10 Examples 

The following examples are intended to illustrate, 
but not limit, the scope of the invention. 

1. Phenotvpe Creation 

In order to obtain lambda phage clones with a range 

15 of desired phenotypes, a combinatorial library selection 
system was used to generate a diverse collection of 
clones. This approach utilized two starting populations 
of lambda phage clones which can be restriction digested, 
mixed, ligated, and packaged to form a library of clones 

20 containing DNA sequences from each of the two populations 
of parent phage. The following example outlines the 
method for rapid construction and selection of lambda 
phage clones containing properties from each of the two 
parent phage populations derived from lambda WT (cI857 
' 25 indl, Sam7) and lambda gtll (SamlOO) . 

Forty micrograms of a population of lambda phage 
derived from wild type lambda (WT) DNA (cI857 Sam7) 
(available from New England Biolabs) was partially 
digested with lambda Hindlll as determined by ethidium 

30 bromide staining on 0.8% agarose gels (Maniatis et al. , 
"Molecular Cloning," Cold Spring Harbor Laboratory 
(1982)). Forty micrograms of a second phage population 
derived from lambda gtll DNA (available from Stratagene 
Cloning Systems, San Diego, CA) was- digested to completion 

35 with Hindlll. Subsequently, this gtll DNA was digested 
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with a second enzyme BamHI in order to reduce the cloning 
efficiency of the left arm of the gtll phage (Maniatis et 
al. , supra ) . Both phage populations had been amplified 
lytically, which allowed for a relatively high degree of 
5 mutations in the resulting DNA. One microgram of the 
lambda WT DNA was ligated at the Hind III site to 1 to 4 
[ig of the lambda gtll DNA using T4 DNA ligase in a volume 
of less than 20^1/ according to Maniatis, et al . , supra . 
The ligation mix was subsequently packaged in lambda phage 

10 packaging extract, Gigapack™ (Stratagene Cloning Systems, 
San Diego, CA) , as described by the manufacturer. 

The packaged phage library contained a mixture of 
many lambda phage constructions. In order to select for 
desired constructions, phenotypic selection was used to 

15 identify those members of the library displaying vigorous 
growth on supE bacterial hosts. As described by Maniatis 
et al , , supra . dilutions of the phage library was plated 
with E. coli C600 cells (Stratagene Cloning Systems, San 
Diego, CA) to generate a lawn of E. coli with isolated 

20 lambda plaques. These isolated plaques are result of 
clonal expansion from a single lambda phage clone. Since 
C600 cells are supE, the growth vigor of the individual 
lambda phage clones could be assessed by the size of the 
lambda phage plaque on the E. coli lawn. The parental WT 

25 phage do not form plaques on E. coli C600. At least three 
classes of phage were identified and subsequently categor- 
ized as small, medium, or large plaque size. The large 
plaque size was an indication of vigorous growth on the 
phage lawn, while small plaque size indicated poor growth. 

3 0 This demonstrates selection for the phenotype of the 
S gene based on plaque size. Other pehnotypes could be 
used for selection. 

Subsequent characterization by restriction mapping 
and plating on supO (these strains contain no amber codon 

3 5 suppressing tRNAs) and supF E. coli hosts, indicated that 
at least one of the large plaque forming clones, L2, did 
not contain an amber mutation as found in the lambda WT 
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(Sam7) or lambda gtll (C5100) parent phage. One of the 
small plaque phage, S2, contained the left arm of lambda 
WT gene and the right arm of lambda gtll containing the 
SamlOO gene. This SamlOO mutation is known to grow poorly 
5 on supE hosts and is optimal on a supF strain, with no 
growth on a supO host. The remaining library of clones 
displayed several different phenotypes, dictated by the 
diversity of the two starting populations of phage. Some 
clones also exhibited phenotypes that resulted from the 

10 random assortment of two mutant DNA fragments derived from 
just one of the parent DNA molecules. THis illustrates 
the concept that the two genes that give rise to the 
populations of interest need not be on separate DNA 
molecules at the start of the method. 

15 Due to the phenotypic selection applied following the 

ligation and packaging of the phage library, the large 
diversity of these two populations of phage was not com- 
pletely analyzed. However, the range of clones identified 
with alternate S gene phenotypes demonstrated some of this 

2 0 diversity. The diversity in these two populations of 
lambda phage is believed to be derived from the low level 
of spontaneous mutations which occur through repeated 
rounds of replication required in large scale preparations 
of lambda phage. However, the spontaneous mutations 

2 5 occurring within each of these individual phage popula- 

tions could not generate a collection of lambda phage 
containing characteristics of both parent populations of 
phage. This combinatorial approach, therefore, provides 
a mechanism in which novel constructions can be generated 

3 0 that express genes from both parent phage constructions. 

2 . Polynucleotide Selection for Immunoglobulin Production 
The nucleotide sequences encoding the immunoglobulin 
protein CDR's are highly variable. However, there are 
several regions of conserved sequences that flank the V H 
3 5 domains. For instance, contain substantially conserved 
nucleotide sequences, i.e. , sequences that will hybridize 
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to the same primer sequence. Therefore, polynucleotide 
synthesis (amplification) primers that hybridize to the 
conserved sequences and incorporate restriction sites into 
the DNA homolog produced that are suitable for operatively 
5 linking the synthesized DNA fragments to a vector were 
constructed. More specifically, the DNA homologs were 
inserted into lambda Zap II vector (Stratagene Cloning 
System, San Diego, CA) at the Xhol and EcoRI sites. For 
amplification of the V H domains, the 3' primer (primer 67 
10 in Table 7) , was designed to be complementary to the mRNA 
in the J H region. In all cases, the 5 1 primers (primers 
56-65,, Table 7) were chosen to be complementary to the 
first strand cDNA in the conserved N-terminus region 
(antisense strand) . Initially amplification was performed 
15 with a mixture of 32 primers (primer 56, Table 7) that 
were degenerate at five positions. Hybridoma mRNA could 
be amplified with mixed primers, but initial attempts to 
amplify mRNA from spleen yielded variable results. 
Therefore, several alternatives to amplification using the 
20 mixed 5 1 primers were compared. 

The first alternative was to construct multiple 
unique primers, eight of which are shown in Table 7, 
corresponding to individual members of the mixed primer 
pool. The individual primers 52-64 of Table 7 were 
25 constructed by incorporating either of the two possible 
nucleotides at three of the five degenerate positions. 

The second alternative was to construct a primer 
containing inosine (primer 65, table 7) at four of the 
variable positions based on the published work of 
3 0 Takahashi, et al., Proc. Natl. Acad. Sci. (U.S.A. ) . 
82:1931-1935, (1985) and Ohtsuka et al., J. Biol. Chem. , 
260:2605-2608, (1985) . This primer has the advantage that 
it is not degenerate and, at the same time minimizes the 
negative effects of mismatches at the unconserved posi- 
35 tions as discussed by Martin et al., Nu. Acids Res . , 
13:8927 (1985) . However, it was not known if the presence 
of inosine nucleotides would result in incorporation of 
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unwanted sequences in the cloned V H regions. Therefore, 
inosine was not included at the one position that remains 
in the amplified fragments after the cleavage of the 
restriction sites. As a result, inosine was not in the 
5 cloned insert. 

Additional, V H amplification primers including the 
unique 3 1 primer were designed to be complementary to a 
portion of the first constant region domain of the gamma 
1 heavy chain mRNA (Primers 70 and 71, Table 7). These 

10 primers will produce DNA homologs containing polynucleo- 
tides coding for amino acids from the V H and the first 
constant region domains of the heavy chain. These DNA 
homologs can therefore be used to produce Fab fragments 
rather than an F v . 

15 As a control for amplification from spleen or 

hybridoma mRNA, a set of primers hybridizing to a highly 
conserved region within the constant region IgG, heavy 
chain gene were constructed. The 5' primer (primer 66, 
Table 7) is complementary to the cDNA in the C H 2 region 

20 whereas the 3 1 primer (primer 68, Table 7) is complement- 
ary to the mRNA in the C H 3 region. It is believed that no 
mismatches were present between these primers and their 
templates. 

The nucleotide sequences encoding the V L CDRs are 

2 5 highly variable. However, there are several regions of 

conserved sequences that flank the V L CDR domains including 
the J L , V L framework regions and V L leader/promoter. 
Therefore, amplification primers that hybridize to the 
conserved sequences and incorporate restriction sites that 

3 0 allowing cloning the amplified fragments into the 

pBluescript SK-vector cut with Nco I and Spel were con- 
structed. For amplification of the V L CDR domains, the 3 1 
primer (primer 69 in Table 7) , was designed to be comple- 
mentary to the mRNA in the J L regions. The 5 1 primer 
3 5 (primer 70, Table 7) was chosen to be complementary to the 
first strand cDNA in the conserved N-terminus region 
(antisense strand) . 
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A second set of amplification primers for amplifica- 
tion of the V L CDR domains the 5' primers (primers 73-80 in 
Table 8) were designed to be complementary to the first 
strand cDNA in the conserved N-terminus region. These 
5 primers also introduced a Sac I restriction endonuclease 
site to allow the FLDNA homolog to be cloned into the V L II- 
expression vector. The 3 1 V t amplification primer (primer 
81 in Table 8) was designed to be complementary to the 
mRNA in the J L regions and to introduce the Xbal restric- 
10 tion endonuclease site required to insert the V L DNA homolog 
into the V L II-expression vector (Figure 8) . 

Additional 3 1 V L amplification primers were designed 
to hybridize to the constant region of either kappa or 
lambda mRNA (primers 82 and 83 in Table 8) . These primers 
15 allow a DNA homolog to be produced containing polynucleo- 
tide sequences coding for constant region amino acids of 
either kappa or lambda chain. These primers make it 
possible to produce an Fab fragment rather than an F v . 

The primers used for amplification of kappa light 
2 0 chain sequences for construction of Fabs are shown at 
least in Table 8. Amplification with these primers was 
performed in 5 separate reactions, each containing one of 
the 5 1 primers (primers 75-78, and 84) and one of the 3 1 
primer (primer 81) has been used to construct F v fragments. 
25 The 5 1 primers contain a Sac I restriction site and the 3" 
primers contain a Xbal restriction site. 

The primers used for amplification of heavy chain Fd 
fragments for construction of Fabs are shown at least in 
Table 7. Amplification was performed in eight separate 
30 reactions, each containing one of the 5 1 primers (primers 
57-64) and one of the 3 1 primers (primer 70) . The remain- 
ing 5 1 primers that have been used for amplification in a 
single reaction are either a degenerate primer (primer 56) 
or a primer that incorporates inosine at four degenerate 
35 positions (primer 66, Table 7, and primers 89 and 90, 
Table 8). The remaining 3 1 primer (primer 86, Table 8) 
has been used to construct F v fragments . Many of the 5 1 
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primers incorporate a Xho I site, and 3' primers include 
a Spel restriction site. 

V amplification primers designed to amplify human 
light chain variable regions of both the lambda and kappa 
5 isotypes are also shown in Table 8. 

All primers and synthetic polynucleotides used herein 
and shown on Tables 7-11 were either purchased from 
Research Genetics in Huntsville, Alabama or synthesized on 
an Applied Biosystems DNA synthesizer, model 381A, usxng 
10 the manufacturer's instructions. 
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3 . Production Of A V„ Coding Repert oire Enriched In 
fttc Binding Proteins 

Fluorescein isothiocyanate (FITC) was selected as a 
ligand for receptor binding. It was further decided to 
5 enrich by immunization the immunological gene repertoire, 
i.e., V H - and V L -coding gene repertoires, for genes coding 
for anti-FITC receptors. This was accomplished by linking 
FITC to keyhole limpet hemocyanin (KLH) using the tech- 
niques described in Antibodies A Labo ratory Manual , Harlow 

10 and Lowe, eds., Cold Spring Harbor, New York, (1988). 
Briefly, 10.0 milligrams (mg) of keyhole limpet hemocyanin 
and 0.5 mg of FITC were added to 1 ml of buffer containing 
0.1 M sodium carbonate at pH 9.6 and stirred for 18 to 24 
hours at 4 degrees C (4C) . The unbound FITC was removed 

15 by gel filtration through Sephadex G-25. 

The KLH— FITC conjugate was prepared for injunction 
into mice by adding 100 \ig of the conjugate to 250 /xl of 
phosphate buffered saline (PBS) . An equal volume of com- 
plete Freund's adjuvant was added and the entire solution 

2 0 was emulsified for 5 minutes. A 129 G IX+ mouse was 

injected with 3 00 /xl of the emulsion. Injections were 
given subcutaneously at several sites using a 21 gauge 
needle. A second immunization with KLH-FITC was given two 
week later. This injection was prepared as follows: 

25 fifty fig of KLH-FITC were diluted in 250 ML of PBS and an 
equal volume of alum was admixed to the KLH-FITC solution. 
The mouse was injected intraperitoneally with 500 jul of 
the solution using a 23 gauge needle. One month later the 
mice were given a final injection of 50 _g of the KLH— 

30 FITC conjugate diluted to 200 _L in PBS. This injection 
was given intravenously in the lateral tail vein using a 
3 0 gauge needle. Five days after this final injection the 
mice were sacrificed and total cellular RNA was isolated 
from their spleens. 

3 5 Hybridoma PCP 8D11 producing an antibody immuno- 

specif ic for phosphonate ester was cultured in DMEM media 
(Gibco Laboratories, Grand Island, New York) containing 10 
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percent fetal calf serum supplemented with penicillin and 
streptomycin. About 5 x 10 s hybridoma cells were harvested 
and washed twice in phosphate buffered saline. Total 
cellular RNA was prepared from these isolated hybridoma 
5 cells. 

4. Preparation Of A V n -Codinq Gene Repertoire 

Total cellular RNA was prepared from the spleen of a 
single mouse immunized with KLH-FITC as described in 
Example 3 using the RNA preparation methods described by 
10 Chomczynski et al., Anal Biochem. , 162:156-159 (1987) 
using the manufacturer's instructions and the RNA isola- 
tion kit produced by Stratagene Cloning Systems, La Jolla, 
CA. Briefly, immediately after removing the spleen from 
the immunized mouse, the tissue was homogenized in 10 ml 
15 of a denaturing solution containing 4 . 0 M guanine isothio- 
cyanate, 0.25 M sodium citrate at pH 7.0, and 0.1 M 2- 
mercaptoethanol using a glass homogenizer. One ml of 
sodium acetate at a concentration of 2 M at pH 4.0 was 
admixed with the homogenized spleen. One ml of phenol 
20 that had been previously saturated with H 2 0 was also 
admixed to the denaturing solution containing the homo- 
genized spleen. Two ml of a chloroform: isoamyl alcohol 
(24:1 v/v) mixture was added to this homogenate. The 
homogenate was mixed vigorously for ten seconds and 
25 maintained on ice for 15 minutes. The homogenate was then 
transferred to a thick-walled 50 ml polypropylene centri- 
fuge tube (Fisher Scientific Company, Pittsburgh, PA) . 
The solution was centrifuged at 10,000 x g for 20 minutes 
at 4°C. The upper RNA-containing aqueous layer was 
3 0 transferred to a fresh 50 ml polypropylene centrifuge tube 
and mixed with an equal volume of isopropyl alcohol. This 
solution was maintained at -20 °C for at least one hour to 
precipitate the RNA. The solution containing the precipi- 
tated RNA was centrifuged at 10,000 x g for twenty minutes 
3 5 at 4°C. The pelleted total cellular RNA was collected and 
dissolved in 3 ml of the denaturing solution described 
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above. Three ml of isopropyl alcohol was added to the 
resuspended total cellular RNA and vigorously mixed. This 
solution was maintained at -20 °C for at least 1 hour to 
precipitate the RNA. The solution containing the precipi- 
5 tated RNA was centrifuged at 10,000 x g for ten minutes at 
4°C. The pelleted RNA was washed once with a solution 
containing 75% ethanol. The pelleted RNA was dried under 
vacuum for 15 minutes and then resuspended to dimethyl 
pyrocarbonate (DEPC) treated H 2 0 (DEPC-H 2 0) . 

10 Messenger RNA (mRNA) enriched for sequences contain- 

ing long poly A tracts was prepared from the total 
cellular RNA using methods described in Molecular Cloning 
A Laboratory Manual , Maniatias et al., eds. Cold Spring 
Harbor Laboratory, New York, (1982) . Briefly, one half of 

15 the total RNA isolated from a single immunized mouse 
spleen prepared as described above was resuspended in one 
ml of DEPC-H 2 0 and maintained at 65 °C for five minutes. 
One ml of 2x high salt loading buffer consisting of 100 mM 
Tris-HCl, 1M sodium chloride, 2.0 mM disodium ethylene 

20 diamine tetraacetic acid (EDTA) at pH 7.5, and 0.2% sodium 
dodecyl sulfate (SDS) was added to the resuspended RNA and 
the mixture allowed to cool to room temperature. The 
mixture was then applied to an oligo-dT (Collaborative 
Research Type 2 or Type 3) column that was previously 

25 prepared by washing the oligo-dT with a solution contain- 
ing 0.1 M sodium hydroxide and 5 mM EDTA and then 
equilibrating the column with DEPC-H 2 0. The eluate was 
collected in a sterile polypropylene tube and reapplied to 
the same column after heating the eluate for 5 minutes at 

3 0 65 °C. The oligo dT column was then washed with 2 ml of 
high salt loading buffer consisting of 50 mM Tris-HCl at 
pH 7.5, 500 mM sodium chloride, 1 mM EDTA at pH 7.5 and 
0.1% SDS. The oligo dT column was then washed with 2 ml 
of 1 X medium salt buffer consisting of 50 mM Tris-HCl at 

35 pH 7.5, 100 mM sodium chloride 1 mM EDTA and 0.1% SDS. 
The messenger RNA was eluted from the oligo dT column with 
lml of buffer consisting of 10 mM Tris-HCL at pH 7.5, 1 mM 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



90 

EDTA at pH 7.5 and 0.05% SDS. The messenger RNA was puri- 
fied by extracting this solution with phenol/ chloroform 
followed by a single extraction with 100% chloroform. The 
messenger RNA was concentrated by ethanol precipitation 
5 and resuspended in DEPC- H 2 0. 

The messenger RNA isolated by the above process 
contains a plurality of different V H coding polynucleo- 
tides, i.e., greater than about 10 4 different V H -coding 
genes . 

10 5. Preparation Of A Single V f| Coding Polynucleotide 

Polynucleotides coding for a single V H were isolated 
according to Example 4 except total cellular RNA was 
extracted from monoclonal hybridoma cells prepared in 
Example 3 . The polynucleotides isolated in this manner 

15 code for a single V H . 

6 . DNA Homolog Preparation 

In preparation for PCR amplification, mRNA prepared 
according to the above examples was used as a template for 
cDNA synthesis by a primer extension reaction. In a 

2 0 typical 50 ul transcription reaction, 5-10 ug of spleen or 

hybridoma mRNA in water was first hybridized (annealed) 
with 500 ng (50.0 pmol) of the 3 r V H primer (primer 67, 
Table 7), at 65 °C for five minutes. Subsequently, the 
mixture was adjusted to 1.5 mM dATP, dCTP, dGTP and dTTP, 
25 40 mM Tris-HCl at pH 8.0, 8 mM MgCl 2 , 50 mM NaCl, and 2 mM 
spermidine. Moloney-Murine Leukemia virus Reverse 

transcriptase (Stratagene Cloning Systems), 26 units, was 
added and the solution was maintained for 1 hours at 37 °C. 
PCR amplification was performed in a 100 ul reaction 

3 0 containing the products of the reverse transcription 

reaction (approximately 5 ug of the cDNA/RNA hybrid) , 300 
ng of 3 1 V H primer (primer 67 of Table 7), 300 ng each of 
the 5 1 V H primers (primer 57-65 of Table 7) 200 mM of a 
mixture of dNTP's, 50 mM KCl , 10 mM Tris-HCl pH 8.3, 15 mM 
35 MgCl 2 , 0.1% gelatin and 2 units of Tag DNA polymerase. The 
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reaction mixture was overlaid with mineral oil and sub- 
jected to 40 cycles of amplification. Each amplification 
cycle involved denaturation at 92 °C for 1 minute, anneal- 
ing at 52 °C for 2 minutes and polynucleotide synthesis by 
5 Primer extension (elongation) at 72 °C for 1.5 minutes* 
The amplified V H -coding DNA homolog containing samples were 
extracted twice with phenol/ chlorof orm, once with chloro- 
form, ethanol precipitated and were stored at -70°C in 10 
mM Tris-HCl, (pH, 7.5) and 1 mM EDTA. 

10 Using unique 5« primers (57-64, Table 7), efficient 

V H -coding DNA homolog synthesis and amplification from the 
spleen mRNA was achieved as shown in Figure 3, lanes R17- 
R24. The amplified cDNA (V H -coding DNA homolog) is seen as 
a major band of the expected size (3 60 bp) . The intensi- 

15 ties of the amplified V H -coding polynucleotide fragment in 
each reaction appear to be similar, indicating that all of 
these primers are about equally efficient in initiating 
amplification. The yield and quality of the amplification 
with these primers was reproducible. 

2 0 The primer containing inosine also synthesized ampli- 

fied V H -coding DNA homologs from spleen mRNA reproducibly , 
leading to the production of the expected sized fragment, 
of an intensity similar to that of the other amplified 
cDNAs (Figure 4, lane R16) . This result indicated that 
25 the presence of inosine also permits efficient DNA homolog 
synthesis and amplification. Clearly indicating how 
useful such primers are in generating a plurality of VH- 
coding DNA homologs. Amplification products obtained from 
the constant region primers (primers 66 and 68, Table 7) 

3 0 were more intense indicating that amplification was more 

efficient, possibly because of a higher degree of homology 
between the template and primers (Figure 4, Lang R9) . 
Based on these results, a V H -coding gene library was 
constructed from the products of eight amplifications, 
35 each performed with a different 5 1 primer. Equal portions 
of the products from each primer extension reaction were 
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mixed and the mixed product was then used to generate a 
library of V H -coding DNA homolog-containing vectors. 

DNA homologs of the V L were prepared from the purified 
mRNA prepared as described above. In preparation for PCR 
5 amplification, mRNA prepared according to the above 
examples was used as a template for cDNA synthesis. In a 
typical 50 ul transcription reaction, 5-10 ug of spleen or 
hybridoma mRNA in water was first annealed with 300 ng 
(50.0 pmol) of the 3 1 V L primer (primer 69 , Table 7), at 

10 65 °C for five minutes. Subsequently, the mixture was 
adjusted to 1.5 mM dATP, dCTP, dGTP, and dTTP, 40 mM Tris- 
HCL at pH 8.0, 8 mM MgCl 2 , 50 mM NaCl, and 2 mM spermidine. 
Moloney-Murine Leukemia virus reverse transcriptase 
(Stratagene Cloning Systems) , 26 units, was added and the 

15 solution was maintained for 1 hour at 37 °C. The PCR 
amplification was performed in a 100 ul reaction contain- 
ing approximately 5 ug of the cDNA/RNA hybrid produced as 
described above, 300 ng of the 3 1 V L primer (primer 69 of 
Table 7), 300 ng of the 5 1 V L primer (primer 70 of Table 

20 7), 200 mM of a mixture of dNTP's, 50 mM KC1, 10 mM Tris- 
HC1 pH 8.3, 15 mM MgCl 2 , 0.1% gelatin and 2 units of Taq 
DNA polymerase. The reaction mixture was overlaid with 
mineral oil and subjected to 40 cycles of amplification. 
Each amplification cycle involved denaturation at 92 °C for 

25 1 minute, annealing at 52 °C for 2 minutes and elongation 
at 72 "C for 1.5 minutes. The amplified samples were 
extracted twice with phenol/chloroform, once with chloro- 
form, ethanol precipitated and were stored at 70 °C in 10 
mM Tris-HCl at pH 7 . 5 and 1 mM EDTA. 

3 0 7 . Inserting DNA Homologs Into Vectors 

In preparation for cloning a library enriched in V H 
sequences, PCR amplified products (2.5 mg/30 ul of 150 mM 
NaCl, 8 mM Tris-HCl (pH 7.5), 6 MM MgSo 4 , 1 mM DTT, 200 
mg/ml bovine serum albumin (BSA) at 37 °C were digested 

3 5 with restriction enzymes Xho I (125 units) and EcoR I (10 
U) and purified on a 1% agarose gel. In cloning experi- 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



93 

merits which required a mixture of the products of the 
amplification reactions, equal volumes (50 ul, 1-10 ug 
concentration) of each reaction mixture were combined 
after amplification but before restriction digestion. 
5 After gel electrophoresis of the digested PCR amplified 
spleen mRNA, the region of the gel containing DNA frag- 
ments of approximately 350 bps was excised , electroeluted 
into a dialysis membrane, ethanol precipitated and resus- 
pended in 10 mM Tris-HCl pH 7.5 and 1 mM EDTA to a final 

10 concentration of 10 ng/ul. Equimolar amounts of the 
insert were then ligated overnight at 5°C to 1 ug of 
Lambda Zap™ II vector (Stratagene Cloning Systems, La 
Jolla, CA) previously cut by EcoR I and Xho I. A portion 
of the ligation mixture (1 ul) was packaged for 2 hours at 

15 room temperature using Gigapack Gold packaging extract 
(Stratagene Cloning Systems, La Jolla, CA) , and the pack- 
aged material was plated on ILl-blue host cells. The 
Library was determined to consist of 2 x 10 7 V H homologs 
with less than 3 0% non-recombinant background. 

2 0 The vector used above, Lambda Zap II is a derivative 

of the original Lambda Zap (ATCC # 40,298) that maintains 
all of the characteristics of the original Lambda Zap 
including 6 unique cloning sites, fusion protein expres- 
sion, and the ability to rapidly excise the insert in the 
25 form of a phagemid (Bluescript SK-) , but lacks the SAM 100 
mutation, allowing growth on many Non-Sup F strains, 
including XLl-Blue. The Lambda Zap II was constructed as 
described in Short et al., Nucleic Acids Res. , 16:7583- 
7600, (1988), by replacing the Lambda S gene contained in 

3 0 a 4 254 base pair (bp) DNA fragment produced by digesting 

Lambda Zap with the restriction enzyme Ncol. This 4254 bp 
DNA fragment was replaced with the 4254 bp DNA fragment 
containing the Lambda S gene isolated from Lambda gtlO 
(ATCC # 40,179) after digesting the vector with the 
35 restriction enzyme Ncol. The 4254 bp DNA fragment 
isolated from lambda gtlO was ligated into the original 
Lambda Zap vector using T4 DNA ligase and standard proto- 
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cols for such procedures described in Current: Protocols in 
Molecular Biology , Ausubel et al., eds., John Wiley and 
Sons, New York, (1987). 

In preparation of cloning a library enriched in V L 
5 sequences, 2 ug of PGR amplified products (2.5 mg/30 ul of 
150 mM NaCl, 8 mM Tris-HCl (pH 7.5) , 6 mM MgSo 4 , 1 mM DTT, 
200 mg/ml BSA) were digested with restriction enzymes Nco 
I (3 0 unites) and Spe I (45 units) at 37 °C for 2 hours. 
The digested PCR amplified products were purified on 1% 

10 agarose gel using standard electroelution technique des- 
cribed in Molecular Cloning A Laboratory Manual , Maniatis 
et al., eds., Cold Spring Harbor, New York, (1982). 
Briefly, after gel electroelution of the digested PCR 
amplified product the region of the gel containing the V L - 

15 coding DNA fragment of the appropriate size was excised, 
electroelution into a dialysis membrane, ethanol precipi- 
tated and resuspended at a final concentration of 10 ng 
per ml in a solution containing 10 mM Tris-HCL at pH 7.5 
and 1 mM EDTA. 

2 0 An equal molar amount of DNA representing a plurality 

of different V L -coding DNA homologs was ligated to a 
pBluescript SK- phagemid vector that had been previously 
cut with Nco I and Spe I. A portion of the ligation mix- 
ture was transformed using the manufacturer's instructions 
25 into Epicuian Coli XLl-Blue competent cells (Stratagene 
Cloning Systems, La Jolla, CA) . The transformant library 
was determined to consist of 1.2 x 10 3 colony forming 
units/ug of V L homologs with less than 3% non-recombinant 
background . 

30 8. Sequencing of Plasmids From the V |r Codina cDNA Library 
To analyze the Lambda Zap II phage clones, the clones 
were excised from Lambda Zap into plasmids according to 
the manufacture's instructions (Stratagene Cloning System, 
La Jolla, CA) . Briefly, phage plaques were cored from the 

3 5 agar plates and transferred to sterile microfuge tubes 

containing 500 ul a buffer containing 50 mM Tris-HCL at pH 
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7.5, 100 mM NaCl, 10 mM MgS0 4 , and 0.01% gelatin and 20 uL 
of Chloroform. 

For excisions, 200 ul of the phage stock, 200 ul of 
XLl-Blue cells (A 600 = 1.00) and 1 ul of R4 08 helper phage 
5 (1 x 10 11 pfu/ml) were incubated at 37 °C for 15 minutes. 
The excised plasmids were infected into XLl-Blue cells and 
plated onto LB plates containing ampicillin. Double 
stranded DNA was prepared from the phagemid containing 
cells according to the methods described by Holmes et al., 

10 Anal. Biochem. ■ 114:193, (1981). Clones were first 
screened for DNA inserts by restriction digests with 
either Pvu II or Bgl I and clones containing the putative 
V u insert were sequenced using reverse transcriptase 
according to the general method described by Sanger et 

15 al., Proc. Natl. Acad, Sci., USA , 74:5463-5467, (1977) and 
the specific modifications of this method provided in the 
manufacturer's instruction in the AMV reverse transcript- 
ase 35 S-dATP sequencing kit from Stratagene Cloning 
Systems, La Jolla, CA. 

2 0 9 . Characterization Of T he Cloned V tl Repertoire 

The amplified products which had been digested with 
Xho I and EcoR I and cloned into Lambda Zap, resulted in 
a cDNA library with 9.0 x 10 5 pfu's. In order to confirm 
that the library consisted of a diverse population of V H - 
25 coding DNA homologs, the N-terminal 12 0 bases of 18 
clones, selected at random from the library, were excised 
and sequenced (Figure 5) . To determine if the clones were 
of V H gene origin, the cloned sequences were compared with 
known V H sequences and V L sequences. The clones exhibited 

3 0 from 8 0 to 90% homology with sequences of known heavy 

chain origin and little homology with sequences of light 
chain origin when compared with the sequences available in 
Sequences of Proteins of Immunologic al Interest by Kabot 
et al., 4th ed., U.S. Dept. of Health and Human Sciences, 
3 5 (1987) . This demonstrated that the library was enriched 
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for the desired V H sequence in preference to other 
sequences, such as light chain sequences. 

The diversity of the population was assessed by 
classifying the sequenced clones into predefined subgroups 

5 (Figure 5) . Mouse V H sequences are classified into eleven 
subgroups (Figure 5) . Mouse V H sequences are classified 
into eleven subgroups [I (A,B,), II (A f B,C) , III (A,B,C,D( 
V (A, B) ] based on framework amino acid sequences described 
in Sequences of proteins of Immunological Interest by 

10 Kabot et al. , 4th ed. , U.S. Dept. of Health and Human 
Sciences, (1987)? Dildrop, Immunology Today . 5:84, (1984); 
and Brodeur et al., Eur. J. Immunol. , 14:922, (1984). 
Classification of the sequenced clones demonstrated that 
the cDNA library contained V H sequences of at least 7 

15 different subgroups. Further, a pairwise comparison of 
the homology between the sequenced clones showed that no 
two sequences were identical at all positions, suggesting 
that the population is diverse to the extent that it is 
possible to characterize by sequence analysis. 

20 Six of the clones (L 36-50, Figure 5) belong to the 

subclass III B and had very similar nucleotide sequences. 
This may reflect a preponderance of mRNA derived from one 
or several related variable genes in stimulated spleen, 
but the data does not permit ruling out the possibility of 

25 a bias in the amplification process. 

10. V u -Expression Vector Construction 

The main criterion used in choosing a vector system 
was the necessity of generating the largest number of Fab 
fragments which could be screened directly. Bacteriophage 

3 0 lambda was selected as the expression vector for three 
reasons. First, in vitro packaging of phage DNA is a 
highly efficient method of reintroducing DNA into host 
cells. Second, it is possible to detect protein expres- 
sion at the level of single phage plaques. Finally, the 

3 5 screening of phage libraries typically involve less 
difficulty with nonspecific binding. An alternative, 
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plasmid cloning vectors, are only advantageous in the 
analysis of clones after they have been identified. This 
advantage is not lost in the present system because of the 
use of lambda Zap, thereby permitting a plasmid containing 
5 the heavy chain, light chain, or Fab expressing inserts 
to be excised. 

To express the plurality of V H -coding DNA homologs in 
an E. coli host cell, a vector was constructed that placed 
the V H -coding DNA homologs in the proper reading frame, 

10 provided a ribosome binding site as described by Shine et 
al., Nature, 254:34, 1975, provided a leader sequence 
directing the expressed protein to the periplasmic space, 
provided a polynucleotide sequence that coded for a known 
epitope (epitope tag) and also provided a polynucleotide 

15 that coded for a spacer protein between the V H -coding DNA 
homolog and the polynucleotide coding for the epitope tag. 
A synthetic DNA sequence containing all of the above 
polynucleotides and features was constructed by designing 
single stranded polynucleotide segments of 2 0-4 0 bases 

20 that would hybridize to each other and form the double 
stranded synthetic DNA sequence shown in Figure 6. The 
individual single-stranded polynucleotides (N1-N12) are 
shown in Table 9 . 

Polynucleotides N2, N3 , N9-4 1 , Nil, N10-5 1 , N6, N7 

25 and N8 were kinased by adding 1 ul of each polynucleotide 
(0.1 ug/ul) and 2 0 units of T4 polynucleotide kinase to a 
solution containing 70 mM Tris-HCl at pH 7 . 6 , 10 mM MgCl 2 
5 mM DTT , 10 mM 2-mercaptoethanol (2ME) , 500 micrograms 
per ml of BSA. The solution was maintained at 37°C for 30 

3 0 minutes and the reaction stopped by maintaining the solu- 
tion at 65 °C for 10 minutes. The two end polynucleotides 
2 0 ng of polynucleotides Nl and polynucleotides N12 , were 
added to the above kinasing reaction solution together 
with 1/10 volume of a solution containing 20.0 mM Tris- 

35 HC1 at pH 7.4, 2 . 0 inM MgCl 2 and 50.0 mM NaCl. This solu- 
tion was heated to 70 °C for 5 minutes and allowed to cool 
to room temperature, approximately 25 °C, over 1.5 hours in 
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a 500 ml beaker of water. During this time period all 10 
polynucleotides annealed to form the double stranded 
synthetic DNA insert shown in Figure 6A. The individual 
polynucleotides were covalently linked to each other to 
5 stabilize the synthetic DNA insert by adding 40 ul of the 
above reaction to a solution containing 50 mM Tris-HCl at 
pH 7.5, 7 mM MgCl 2 , 1 mM DTT, 1 mM adenosine triphosphate 
(ATP) and 10 units of T4 DNA ligase. This solution was 
maintained at 37 °C for 3 0 minutes and then the T4 DNA 

10 ligase was inactivated by maintaining the solution at 65 °C 
for 10 minutes. The end polynucleotides were kinased by 
mixing 52 ul of the above reaction, 4 ul of a solution 
containing 10 mM ATP and 5 units of T4 polynucleotide 
kinase. This solution was maintained at 37 °C for 3 0 

15 minutes. The completed synthetic DNA insert was ligated 
directly into a lambda Zap II vector that had been previ- 
ously digested with the restriction enzymes Not I and Xho 
I. The ligation mixture was packaged according to the 
manufacture's instructions using Gigapack II Gold packing 

2 0 extract available from Stratagene Cloning Systems, La 
Jolla, CA. The packaged ligation mixture was plated on 
XL1 blue cells (Stratagene Cloning Systems, San Diego, 
CA) . Individual lambda Zap II plaques were cored and the 
inserts excised according to the in vivo excision protocol 

25 provided by the manufacturer, Stratagene Cloning Systems, 
La Jolla, CA. This in vivo excision protocol moves the 
cloned insert from the lambda Zap II vector into a plasmid 
vector to allow easy manipulation and sequencing. The 
accuracy of the above cloning steps was confirmed by 

30 sequencing the insert using the Sanger dideoxide method 
described in by Sanger et al., Proc. Natl. Acad. Sci USA , 
74:5463-5467, (1977) and using the manufacture's instruc- 
tion in the AMV Reverse Transcriptase 35 S-ATP sequencing 
kit from Stratagene Cloning Systems, La Jolla, CA. The 

35 sequence of the resulting V H expression vector is shown in 
Figure 6 A and Figure 7 . 
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Table 9 






(91) 


Nl) 


5 


' GGCCGCAAATTCTATTTCAAGGAGACAGTCAT 3 1 


(92) 


N2) 


5 


» AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3 ' 


(93) 


N3) 


5 


• GTTATTACTCGCTGCCCAACCAGCCATGGCCC 3 ' 


5 (94) 


N4) 


5 


• AGGTGAAACTGCTCGAGAATTCTAGACTAGGTTAATAG 3 • 


(95) 


N5) 


5 


• TCGACTATTAACTAGTCTAGAATTCTCGAG 3 ' 


(96) 


N6) 


5 


• CAGTTTCACCTGGGCCATGGCTGGTTGGG 3 • 


(97) 


N7) 


5' 


CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3 ' 


(98) 


N8) 


5 


• GTATTTCATTATGACTGTCTCCTTGAAATAGAATTTGC 3 1 


10 (99) 


N9-4) 


5' 


AGGTGAAACTGCTCGAGATTTCTAGACTAGTTACCCGTAC 3 1 


(100) 


Nil) 




5 ' GACGTTCCGGACTACGGTTCTTAATAGAATTCG 3 ' 


(101) 


N12) 




5 ' TCGACGAATTCTATTAAGAACCGTAGTC 3 • 



(102) N10-5) 5 CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 3' 

11. V L Expression Vector Co nstruction 

15 To express the plurality of V L coding polynucleotides 

in an E. coli host cell, a vector was constructed that 
placed the V L coding polynucleotide in the proper reading 
frame, provided a ribosome binding site as described by 
Shine et al., Nature , 254:34, (1975), provided a leader 

20 sequence directing the expressed protein to the piro- 
plasmic space and also provided a polynucleotide that 
coded for a spacer protein between the V L polynucleotide 
and the polynucleotide coding for the epitope tag. A 
synthetic DNA sequence containing all of the above poly- 

25 nucleotides and features was constructed by designing 
single stranded polynucleotide segments of 2 0-40 bases 
that would hybridize to each other and form the double 
stranded synthetic DNA sequence shown in Figure 6B. The 
individual single-stranded polynucleotides (N1-N8) are 

3 0 shown in Table 9. 

Polynucleotides N2 , N3 , N4 , N6, N7 and N8 were 
kinased by adding 1 ul of each polynucleotide and 20 units 
of T4 polynucleotide kinase to a solution containing 7 0 mM 
Tris-HCL at pH 7.6, 10 mM MgCl 2 , 5 mM DTT, 10 mM 2ME, 500 

3 5 micrograms per ml of BSA. The solution was maintained at 
37 °C for 30 minutes and the reaction stopped by maintain- 
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ing the solution at 65 °C for 10 minutes. The two end 
polynucleotides 20 ng of polynucleotides Nl and poly- 
nucleotides N5 were added to the above kinasing reaction 
solution together with 1/10 volume of a solution contain- 
5 ing 20.0 mM Tris-HCL at pH 7.4, 2.0 mM MgCl 2 and 50.0 mM 
NaCl. This solution was heated to 70 °C for 5 minutes and 
allowed to cool to room temperature, approximately 25 °C, 
over 1.5 hours in a 500 ml beaker of water. During this 
time period all the polynucleotides annealed to form the 

10 double stranded synthetic DNA insert. The individual 
polynucleotides were covalently linked to each other to 
stabilize the synthetic DNA insert with adding 4 0 ul of 
the above reaction to a solution containing 50 ul Tris- 
HCL at pH 7.5, 7 mM MgCl 2 , 1 mM DTT, 1 mM ATP and 10 units 

15 to T4 DNA ligase. This solution was maintained at 37 °C 
for 3 0 minutes and then the T4 DNA ligase was inactivated 
by maintaining the solution at 65 °C for 10 minutes. The 
end polynucleotides were kinased by mixing 52 ul of the 
above reaction, 4 ul of a solution recontaining 10 mM ATP 

20 and 5 units of T4 polynucleotide kinase. This solution 
was maintained at 37 °C for 30 minutes and then the T4 
polynucleotide kinase was inactivated by maintaining the 
solution at 65 °C for 10 minutes. The completed synthetic 
DNA insert was ligated directly into a lambda Zap II 

25 vector that had been previously digested with the restric- 
tion enzymes Not I and Xho I. The ligation mixture was 
packaged according to the manufacture's instructions using 
Gigapack II Gold packing extract available from Stratagene 
Cloning Systems, La Jolla, CA. The packaged ligation 

3 0 mixture was plated on XLl-Blue cells (Stratagene Cloning 
Systems, La Jolla, CA) . Individual lambda Zap II plagues 
were cored and the inserts excised according to the in 
vivo excision protocol provided by the manufacturer, 
Stratagene Cloning Systems, La Jolla, CA and described in 
35 Short et al., Nucleic Acids Res. , 16:7583-7600 (1988). 
This in vivo excision protocol moves the cloned insert 
from the lambda Zap II vector into a phagemid vector to 
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allow easy manipulation and sequencing and also produces 
the phagemid version of the V L expression vectors. The 
accuracy of the above cloning steps was confirmed by 
sequencing the insert using the Sanger dideoxide method 
5 described by Sanger et al., Proc. Natl. Acad. Aci. USA, 
2Ai 5463-5467, (1977) and using the manufacturer's instruc- 
tions in the AMV reverse transcriptase 35 S-dATP sequencing 
kit from Stratagene Cloning Systems, La Jolla, CA. The 
sequence of the resulting V L expression vector is shown in 

10 Figure 6 and Figure 8. 

The V L expression vector used to construct the V L 
library was the phagemid produced to allow the DNA of the 
V L expression vector to be determined. The phagemid was 
produced, as detailed above, by the in vivo excision 

15 process from the Lambda Zap V L expression vector (Figure 
8) . The phagemid version of this vector was used because 
the Nco I restriction enzyme site is unique in this 
version and thus could be used to operatively linked the 
V L DNA homologs into the expression vector. 

20 12. v L II-Expression Vector Construction 

To express the plurality of V L -coding DNA homologs in 
an E. coli host cell, a vector was constructed that placed 
the V L -coding DNA homologs in the proper reading frame, 
provided a ribosome binding site as described by Shine et 

25 al., Nature , 254:34, 1975, provided the Pel B gene leader 
sequence that has been previously used to successfully 
secrete Fab fragments in E. coli by Lei et al., J . Bac. , 
169:4379 (1987) and Better et al., Science , 240:1041 
(1988) , and also provided a polynucleotide containing a 

3 0 restriction endonuclease site for cloning. A synthetic 
DNA sequence containing all of the above polynucleotides 
and features was constructed by designing single stranded 
polynucleotide segments of 20-60 bases that would hybrid- 
ize to each other and form the double stranded synthetic 

3 5 DNA sequence shown in Figure 10. The sequence of each 
individual single-stranded polynucleotides (01-08) within 
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the double stranded synthetic DNA sequence is shown in 
Table 10. 

Polynucleotides 02, 03, 04, 05, 06 and 07 were 
kinased by adding 1 ul (0-1 ug/ul) of each polynucleotide 
5 and 20 units of T4 polynucleotide kinase to a solution 
containing 70 mM Tris-HCL at pH 7.6, 10 mM magnesium 
chloride (MgCl) , 5 mM dithiothreitol (DTT) , 10 mM 2- 
mercaptoethanol (2ME) , 500 micrograms per ml of bovine 
serum albumin. The solution was maintained at 37 °C for 30 

10 minutes and the reaction stopped by maintaining the 
solution at 65 °C for 10 minutes. The 20 ng each of the 
two end polynucleotides, 01 and 08, were added to the 
above kinasing reaction solution together with 1/10 volume 
of a solution containing 20.0 mM Tris-HCl at pH 7.4, 2.0 

15 mM MgCl and 15.0 mM sodium chloride (NaCl) . This solution 
was heated to 70 °C for 5 minutes and allowed to cool to 
room temperature, approximately 25 °C, over 1.5 hours in a 
500 ml beaker of water. During this time period all 8 
polynucleotides annealed to form the double stranded 

2 0 synthetic DNA insert shown in Figure 9. The individual 

polynucleotides were covalently linked to each other to 
stabilize the synthetic DNA insert by adding 40 ul of the 
above reaction to a solution containing 50 ml Tris-HCl at 
pH 7.5, 7 ml MgCl, 1 mm DTT, 1 mm ATP and 10 units of T4 
25 DNA ligase. This solution was maintained at 37 °C for 30 
minutes and then the T4 DNA ligase was inactivated by 
maintaining the solution at 65°C for 10 minutes. The end 
polynucleotides were kinased by mixing 52 ul of the above 
reaction, 4 ul of a solution containing 10 mM ATP and 5 

3 0 units of T4 polynucleotide kinase. This solution was 

maintained at 37 °C for 3 0 minutes and then the T4 poly- 
nucleotide kinase was inactivated by maintaining the 
solution at 65 e C for 10 minutes. The completed synthetic 
DNA insert was ligated directly into a lambda Zap II 
35 vector that had been previously digested with the restric- 
tion enzymes Not I and Xho I. The ligation mixture was 
packaged according to the manufacturer's instructions 
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using Gigapack II Gold packing extract available from 
Stratagene Cloning Systems, La Jolla, CA. The packaged 
ligation mixture was plated on XL1 blue cells (Stratagene 
Cloning Systems, San Diego, CA) . Individual lambda Zap II 
5 plaques were cored and the inserts excised according to 
the in vivo excision protocol provided by the manufac- 
turer, Stratagene Cloning Systems, La Jolla, CA. This in 
vivo excision protocol moves the cloned insert from the 
lambda Zap II vector into a plasmid vector to allow easy 

10 manipulation and sequencing. The accuracy of the above 
cloning steps was confirmed by sequencing the insert using 
the manufacturer's instructions in the AMV Reverse 
Transcriptase 35 S-dATP sequencing kit from Stratagene 
Cloning Systems, La Jolla, CA. The sequence of the 

15 resulting V L II-expression vector is shown in Figure 9 and 
Figure 11. 

Table 10 

(102) 01) 5* TGAATTCTAAACTAGTCGCCAAGGAGACAGTCAT 3' 

(103) 02) 5' AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3' 
20 (104) 03) 5' GTTATTACTCGCTGCCCAACCAGCCATGGCC 3' 

(105) 04) 5' GAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 3' 

(106) 05) 5« 

GTATTTCATTATGACTGTCTCCTTGGCGACTAGTTTAGAATTCAAGCT 
3 ' 

25 (107) 06) 5' CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3« 

(108) 07) 5' TGACGAGCTCGGCCATGGCTGGTTGGG 3' 

(109) 08) 5' TCGACGGCCGCTTAACTCTAGAAC 3' 

13 . Vh ± V L T.ibrarv Construction 

To prepare an expression library enriched in V H 

3 0 sequences, DNA homologs enriched in V H sequences were 
prepared according to Example 7 using the same set of 5' 
primers but with primer 62A (Table 7) as the 3' primer. 
These .homologs were then digested with the restriction 
enzymes Xho I and Spe I and purified on a 1% agarose gel 

35 using the standard electroelution technique described in 
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Molecular Cloni ng A Laboratory Manual , Maniatis et al., 
eds., Cold Spring Harbor, New York, (1982). These 
prepared V H DNA homologs were then directly inserted into 
the V H expression vector that had been previously digested 
5 with Xho I and Spe I. 

The ligation mixture containing the V H DNA homologs 
were packaged according to the manufacturers specifica- 
tions using Gigapack Gold II Packing Extract (Stratagene 
Cloning Systems, La Jolla, CA) . The expression libraries 

10 were then ready to be plated on XL-1 Blue cells. 

To prepare a library enriched in V L sequences, PCR 
amplified products enriched in V L sequences were prepared 
according to Example 7. The V L DNA homologs were digested 
with restriction enzymes Nco I and Spe I. The digested V L 

15 DNA homologs were purified on a 1% agarose gel using 
standard electrelusion techniques described in Molecular 
Cloning A Labo ratory Manual , Maniatis et al., eds. , Cold 
Spring Harbor, NY (1982) . The prepared V L DNA homologs 
were directly inserted into the V L expression vector that 

2 0 had been previously digested with the restriction enzymes 

Nco I and Spe I. The ligation mixture containing the V L 
DNA homologs were transformed into XL-1 blue competent 
cells using the manufacturer's instructions (Stratagene 
Cloning Systems, La Jolla, CA) . 

25 14 . Inserting V L Coding DNA H omologs Into V L Expression 
Vector 

In preparation for cloning a library enriched in V L 
sequences, PCR amplified products (2.5 ug/30 ul of 150 mM 
NaCl, 8 mM Tris-HCl (pH 7.5), 6 mM MgCl 4 , 1 mM DTT, 200 

3 0 ug/ml BSA at 37 °C were digested with restriction enzymes 

Sac I (125 units) and Xba I (125 units) and purified on a 
1% agarose gel. In cloning experiments which required a 
mixture of the products of the amplification reactions, 
equal volumes (50 ul, 1-10 ug concentration) of each 
35 reaction mixture were combined after amplification but 
before restriction digestion. After gel electrophoresis 
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of the digested PCR amplified spleen mRNA, the region of 
the gel containing DNA fragments of approximately 3 50 bps 
was excised, electroeluted into a dialysis membrane, 
ethanol precipitated and resuspended in a TE solution 
5 containing 10 mM Tris-HCl pH 7.5 and 1 mM EDTA to a final 
concentration of 50 ng/ul. 

The V L II-expression DNA vector was prepared for 
cloning by admixing 100 ug of this DNA to a solution 
containing 250 units each of the restriction endonucleases 

10 Sac 1 and Xba 1 (both from Boehringer Mannheim, 
Indianapolis, IN) and a buffer recommended by the manu- 
facturer. This solution was maintained at 37°C for 1.5 
hours. The solution was heated at 65 °C for 15 minutes to 
inactivate the restriction endonucleases. The solution 

15 was chilled to 30 "C and 25 units of heat-killable (HK) 
phosphatase (Epicenter, Madison, WI) and CaCl 2 were admixed 
to it according to the manufacturer's specifications. 
This solution was maintained at 30 °C for 1 hour. The DNA 
was purified by extracting the solution with a mixture of 

20 phenol and chloroform followed by ethanol precipitation. 
The V L II expression vector was now ready for ligation to 
the V L DNA homologs prepared in the above examples. 

DNA homolog enriched in V L sequences were prepared 
according to Example 6 but using a 5' light chain primer 

25 and 3 f light chain primer shown in Table 9. Individual 
amplification reactions were carried out using each 5 ' 
light chain primer in combination with the 3 ' light chain 
primer. These separate V L homolog-containing reaction 
mixtures were mixed and digested with the restriction 

3 0 endonucleases Sac 1 and Xba 1 according to Example 7. The 
V L homologs were purified on a 1% agarose gel using the 
standard electroelution technique described in Molecular 
Cloning A Laboratory Manual . Maniatis et al., eds., Cold 
Spring Harbor, New York, (1982). These prepared V L DNA 

35 homologs were then directly inserted into the Sac 1 - Xba 
cleaved V L II-expression vector that was prepared above by 
ligating 3 moles of V L DNA homolog inserts with each mole 
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of the V L lI-expression vector overnight at 5 8 C. 3.0 x 10 5 
plaque forming units were obtained after packaging the DNA 
with Gigapack II Bold (Stratagene Cloning Systems, La 
Jolla, CA) and 50% were recombinants. 

5 15 . Randomly Combining V n and V L DNA Homoloas on the Same 
Expression Vector 

The V L II-expression library prepared in Example 13 was 
amplified and 500 ug of V L II-expression library phage DNA 
prepared from the amplified phage stock using the proce- 

10 dures described in Molecular Cloning; A Laboratory 
Manual , Maniatis et al., eds., Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY (1982) , 50 ug of this 
V L II-expression library phage DNA was maintained in a 
solution containing 100 units of Lul restriction endo- 

15 nuclease (Boehringer Mannheim, Indianapolis, IN) in 200 ul 
of a buffer supplied by the endonuclease manufacturer for 
1.5 hours at 37 °C. The solution was then extracted with 
a mixture of phenol and chloroform. The DNA was then 
ethanol precipitated and resuspended in 100 ul of water. 

20 This solution was admixed with 100 units of the restric- 
tion endonuclease EcoR I (Boehringer Mannheim, 
Indianapolis, IN) in a final volume of 200 ul of buffer 
containing the components specified by the manufacturer. 
This solution was maintained at 37 °C for 1.5 hours and the 

25 solution was then extracted with a mixture of phenol and 
chloroform. The DNA was ethanol precipitated and the DNA 
resuspended in TE. 

The V H expression library prepared in Example 13 was 
amplified and 500 ug of V H expression library phage DNA 

3 0 prepared using the methods detailed above. 50 ug of the 
V H expression library phage DNA was maintained in a solu- 
tion containing 100 units of Hind III restriction endo- 
nuclease (Boehringer Mannheim, Indianapolis, IN) in 200 ul 
of a buffer supplied by the endonuclease manufacturer for 

3 5 1.5 hours at 37 °C. The solution was then extracted with 
a mixture of phenol and chloroform saturated with 0.1 
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Tris-HCL at pH 7.5. The DNA was then ethanol precipitated 
and resuspended in 100 ul of water. This solution was 
admixed with 100 units of the restriction endonuclease 
EcoR I (Boehringer Mannheim, Indianapolis, IN) in a final 
5 volume of 200 ul of buffer containing the components 
specified by the manufacturer. This solution was main- 
tained at 37 °C for 1.5 hours and the solution was then 
extracted with a mixture of phenol and chloroform. The 
DNA was ethanol precipitated and the DNA resuspended in 
10 TE. 

The restriction digested V H and V L II -express ion 
Libraries were ligated together. The ligation reaction 
consisted of 1 ug of V H and 1 ug of V L II phage library DNA 
in a 10 ul reaction using the reagents supplied in a liga- 

15 tion kit purchased from Stratagene Cloning Systems (La 
Jolla, CA) . After ligation for 16 hr at 4°C, 1 ul of the 
ligated the phage DNA was packaged with Gigapack Gold II 
packaging extract and plated on XL 1-blue cells prepared 
according the manufacturer's instructions. A portion of 

20 the 3X10 6 clones obtained were used to determine the 
effectiveness of the combination. The resulting V H and V L 
expression vector is shown in Figure 11. 

Clones containing both V H and V L were excised from the 
phage to pBluescript using the in vitro excision protocol 

25 described by Short et al., Nucleic Acid Research , 16L7583- 
7600 (1988) . Clones chosen for excision expressed the 
decapeptide tag and did not cleave X-gal in the presence 
of 2mM IPTG, thus remaining white. Clones with these 
characteristics represented 30% of the library. 50% of 

3 0 the clones chosen for excision contained a V H and V L as 
determined by restriction analysis. Since approximately 
3 0% of the clones in the V H library expressed the decapep- 
tide tag and 50% of the clones in the V L II library 
contained a V L sequence it was anticipated that no more 

35 than 15% of the clones in the combined library would 
contain both V H and V L clones. The actual number obtained 
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was 15% of the library indicating that the process of 
combination was very efficient. 

16. Segregating DNA Homologs For a V„ Antigen Binding 
Protein 

5 To segregate the individual clones containing DNA 

homologs that code for a V H antigen binding protein, the 
titre of the V H expression library prepared according to 
Example 12 was determined. This library titration was 
performed using methods well known to one skilled in the 

10 art. Briefly, serial dilutions of the library were made 
into a buffer containing 100 mM NaCl, 50 mM Tris-HCL at 
pH 7.5 and 10 mM MgCl 4 , 5 g/L yeast extract, 10 g/L NZ 
amine (casein hydrolysate) and 0.7% melted, 50 °C agarose. 
The phage, the bacteria and the top agar were mixed and 

15 then evenly distributed across the surface of a prewarmed 
bacterial agar plate (5 g/L NaCl, 2 g/L MgCl 4 , 5 g/L yeast 
extract, 10 g/L NZ amine (casein hydrolysate) and 15 g/L 
Difco agar. The plates were maintained at 37 °C for 12 to 
24 hours during which time period the lambda plaques 

2 0 developed on the bacterial lawn. The lambda plaques were 
counted to determine the total number of plaque forming 
units per ml in the original library. 

The titred expression library was then plated out so 
that replica filters could be made from the library. The 

2 5 replica filters will be used to later segregate out the 

individual clones in the library that are expressing the 
antigens binding proteins of interest. Briefly, a volume 
of the titred library that would yield 20,000 plaques per 
150 millimeter plate was added to 600 ul of exponentially 

3 0 growing E. coli cells and maintained at 37 °C for 15 min- 

utes to allow the phage to absorb to the bacterial cells. 
The 7.5 ml of top agar was admixed to the solution 
containing the bacterial cells and the absorbed phage and 
the entire mixture distributed evenly across the surface 
35 of a prewarmed bacterial agar plate. This process was 
repeated for a sufficient number of plates to plate out a 
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total number of plaques at least equal to the library 
size. These plates were then maintained at 37 °C for 5 
hours. The plates were then overlaid with nitrocellulose 
filters that had been pretreated with a solution contain- 
5 ing 10 mM isopropyl-beta-D-thiogalactopyanoside (IPTG) and 
maintained at 37 °C for 4 hours. The orientation of the 
nitrocellulose filters in relation to the plate were 
marked by punching a hole with a needle dipped in water- 
proof ink through the filter and into the bacterial plates 

10 at several locations. The nitrocellulose filters were 
removed with forceps and washed once in a TBST solution 
containing 20 mM Tris-HCl at pH 7.5, 150 mM NaCl and 0.05% 
monolaurate (tween-20) . A second nitrocellulose filter 
that had also been soaked in a solution containing 10 mM 

15 IPTG was reapplied to the bacterial plates to produce 
duplicate filters. The filters were further washed in a 
fresh solution of TBST for 15 minutes. Filters were then 
placed in a blocking solution consisting 20 mM Tris-HCl at 
pH 7.5, 150 mM NaCl and 1% BSA and agitated for 1 hour at 

2 0 room temperature. The nitrocellulose filters were trans- 

ferred to a fresh blocking solution containing a 1 to 500 
dilution of the primary antibody and gently agitated for 
at least 1 hour at room temperature. After the filters 
were agitated in the solution containing the primary 
25 antibody the filters were washed 3 to 5 times in TBST for 
5 minutes each time to remove any of the residual unbound 
primary antibody. The filters were transferred into a 
solution containing fresh blocking solution and a 1 to 500 
to a 1 to 1,000 dilution of alkaline phosphatase conju- 

3 0 gated secondary antibody. The filters were gently 

agitated in the solution for at least 1 hour at room 
temperature. The filters were washed 3 to 5 times in a 
solution of TBST for at least 5 minutes each time to 
remove any residual unbound secondary antibody. The 
3 5 filters were washed once in a solution containing 20 mM 
Tris-HCl at pH 7.5 and 150 mM NaCl. The filters were 
removed from this solution and excess moisture blotted 
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from them with filter paper. The color was developed by 
placing the filter in a solution containing 100 mM Tris- 
HC1 at pH 9.5, 100 mM NaCl , 5 mM MgCl 2 , 0.3 mg/ml of nitro 
Blue Tetrazolium (NBT) and 0.15 mg/ml of 5-bromo-4-chloro- 
5 3 -indolyl -phosphate (BCIP) for at least 30 minutes at room 
temperature. The residual color development solution was 
rinsed from the filter with a solution containing 20 mM 
Tris-HCl at pH 7.5 and 150 mM NaCl. The filter was then 
placed in a stop solution consisting of 20 mM Tris-HCl at 
10 pH 2.9 and 1 mM EDTA. The development of an intense 
purple color indicates a positive result. The filters are 
used to locate the phage plaque that produced the desired 
protein. That phage plaque is segregated and then grown 
up for further analysis. 
15 Several different combinations of primary antibodies 

and second antibodies were used. The first combination 
used a primary antibody immunospecif ic for a decapeptide 
that will be expressed only if the V H antigen binding 
protein is expressed in the proper reading frame to allow 
20 read through translation to include the decapeptide epi- 
tope covalently attached to the V H antigen binding protein. 
This decapeptide epitope and an antibody immunospecif ic 
for this decapeptide epitope was described by Green et 
al. f Cell 28:477 (1982) and Niman et al., Proc. Nat. Acad. 
25 Sci. U.S.A. 80:4949 (1983). The sequence of the decapep- 
tide recognized is shown in Figure 11. A functional 
equivalent of the monoclonal antibody that is immuno- 
specif ic for the decapeptide can be prepared according to 
the methods of Green et al. and Niman et al. The secon- 
3 0 dary antibody used with this primary antibody was a goat 
antimouse IgG (Fisher Scientific) . This antibody in 
immunospecif ic for the strand region of mouse IgG and did 
not recognize any portion of the variable region of heavy 
chain. This particular combination of primary and secon- 
3 5 dary antibodies when used according to the above protocol 
determined that between 25% and 3 0% of the clones were 
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expressing the decapeptide and therefore these clones were 
assumed to also be expressing a V H antigen binding protein. 

In another combination the anti-decapeptide mouse 
monoclonal was used as the primary antibody and an aff in- 
5 ity purified goat anti-mouse Ig, commercially available as 
part of the picoBlue immunoscreening kit from Stratagene 
Cloning System, La Jolla, CA, was use as the secondary 
antibody. This combination resulted in a large number of 
false positive clones because the secondary antibody also 

10 immunoreacted with the V H of the heavy chain. Therefore 
this antibody reacted with all clones expressing any V H 
protein and this combination of primary and secondary 
antibodies did not specifically detect clones with the V H 
polynucleotide in the proper reading frame and thus 

15 allowing expressing of the decapeptide. 

Several combinations of primary and secondary anti- 
bodies are used where the primary antibody is conjugated 
to fluorescein isothiocyanate (FITC) and thus the immuno- 
specificity of the antibody was not important because the 

20 antibody is conjugated to the preselected antigen (FITC) 
and it is that antigen that should be bound by the V H 
antigen binding proteins produced by the clones in the 
expression library. After this primary antibody has bound 
by virtue that is FITC conjugated mouse monoclonal anti- 

25 body p2 5764 (ATCC #HB-9505) . The secondary antibody used 
with this primary antibody is a goat anti-mouse Ig 6 (Fisher 
Scientific, Pittsburgh, PA) conjugated to alkaline phos- 
phatase using the method described in Antibodies; A 

Laboratory Manual , Harlow and Lowe, eds., Cold Spring 

3 0 Harbor, New York, (1988) . If a particular clone in the V H 
expression, library, expresses a V H binding protein that 
binds the FITC covalently coupled to the primary antibody, 
the secondary antibody binds specifically and when devel- 
oped the alkaline phosphate causes a distinct purple color 

3 5 to form. 

The second combination of antibodies of the type uses 
a primary antibody that is FITC conjugated rabbit anti- 
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human IgG (Fisher Scientific, Pittsburgh, PA) . The secon- 
dary antibody used with this primary antibody is a goat 
anti-rabbit IgG conjugated to alkaline phosphatase using 
the methods described in Antibodies A Laboratory Manual , 
5 Harlow and Lane, eds., Cold Spring Harbor, New York, 
(1988) . If a particular clone in the V H expression library 
expresses a V H binding protein that binds the FITC conju- 
gated to the primary antibody, the secondary antibody 
binds specifically and when developed the alkaline 

10 phosphatase causes a distinct purple color to form. 

Another primary antibody was the mouse monoclonal 
antibody p2 5764 (ATCC # HB-9505) conjugated to both FITC 
and 125 I. The antibody would be bound by any V H antigen 
binding proteins expressed. Then because the antibody is 

15 also labeled with 125 I, an autoradiogram of the filter is 
made instead of using a secondary antibody that is conju- 
gated to alkaline phosphatase. This direct production of 
an autoradiogram allows segregation of the clones in the 
library expressing a V H antigen binding protein of 

20 interest. 

17. Segregati ng DNA Homologs For a v„ and V L that Form an 
Antigen Binding F v 

To segregate the individual clones containing DNA 
homologs that code for a V H and V L that form an antigen 

25 binding F v , an V H and V L expression library was titred 
according to Example 15. The titred expression library 
was then screened for the presence of the decapeptide tag 
expressed with the V H using the methods described in 
Example 16. DNA was then prepared from the clones to 

3 0 express the decapepide tag. This DNA was digested with 
the restriction endonuclease Pvu II to determine whether 
these clones also contained a V L DNA homolog. The slower 
migration of a PvuII restriction endonuclease fragment 
indicated that the particular clone contained both a V H and 

35 a V L DNA homolog. 
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The clones containing both a V H and a V L DNA homolog 
were analyzed to determine whether these clones produced 
an assembled F v protein molecule from the V H and V L DNA 
homologs . 

5 The F v protein fragment produced in clones containing 

both V H and V L was visualized by immune precipitation of 
radiolabeled protein expressed in the clones. A 50 ml 
culture of LB broth (5 g/L yeast extract, 10 g/L and tryp- 
tone 10 g/L NaCl at pH 7.0) containing 100 ug/ul of 

10 ampicillin was inoculated with E. Coli harboring a plasmid 
contain a V H and a V L . The culture was maintained at 37 °C 
with shaking until the optical density measured at 550 nm 
was 0.5. The culture then was centrifuged at 3,000 g for 
10 minutes and resuspended in 50 ml of M9 media (6 g/L 

15 Na 2 HP0 4/ 3 g/L KH 2 P0 4 , 0.5 g/L NaCl, 1 g/L NH 4 C1, 2g/L 
glucose, 2 mM MgS0 4 and 0.1 mMgS0 4 CaCl 2 supplemented with 
amino acids without methionine or cysteine. This solution 
was maintained at 37 °C for 5 minutes and then 0.5 mCi of 
35 S as HS0 4 (New England Nuclear, Boston, MA) was added and 

2 0 the solution was further maintained at #&C for an addi- 

tional 2 hours. The solution was then centrifuged at 300 
x g and the supernatant discarded. The resulting bacter- 
ial cell pellet was frozen and thawed and then resuspended 
for 10 minutes and the resulting pellet discarded. The 
25 supernatant was admixed with 10 ul of anti-decapeptide 
monoclonal antibody and maintained for 3 0-90 minutes on 
ice. 4 0 ul of protein G coupled to sepharose beads 
(Pharmacia, Piscataway, NJ) was admixed to the solution 
and the added solution maintained for 30 minutes on ice to 

3 0 allow an immune precipitate to form. The solution was 

centrifuged at 10,000 x g for 10 minutes and the resulting 
pellet was resuspended in 1 ml of a solution containing 
100 mM Tris-HCl at Ph 7.5 and centrifuged at 10,000 x g 
for 10 minutes. This procedure was repeated twice. The 
3 5 resulting immune precipitate pellet was loaded onto a 
PhastGel Homogenous 2 0 gel (Pharmacia, Piscataway, NJ) 
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according to the manufacturer's directions. The gel was 
dried and used to expose X-ray film. 

The resulting autoradiogram is shown in Figure 12 . 
The presence of V L that was immunoprecipitated because it 
5 was attached to the V H -decapepide tag recognized by the 
precipitating antibody. 

18. Generation of a Combinatorial Library of the 

Immunoglobulin Repertoire in Phage 

Vectors suitable for expression of V H , V L/ Fv and Fab 
10 sequences are diagrammed in Figures 7 and 9. As previ- 
ously discussed, the vectors were constructed by modifi- 
cation of Lambda Zap by inserting synthetic oligonucleo- 
tides into the multiple cloning site. The vectors were 
designed to be antisymmetric with respect to the Not I and 
15 EcoR I restriction sites which flank the cloning and 
expression sequences. As described below, this anti- 
symmetry in the placement of restriction sites in a linear 
vector like bacteriophage allows a library expressing 
light chains to be combined with one expressing heavy 

2 0 chains to construct combinatorial Fab expression 

libraries. Lambda Zap II V L II (Figure 9) is designed to 
serve as a cloning vector for light chain fragments and 
Lambda Zap II V H (Figure 7) is designed to serve as a 
cloning vector for heavy chain sequences in the initial 
25 step of library construction. These vectors are engi- 
neered to efficiently clone the products of PCR 
amplification with specific restriction sites incorporated 
at each end. 

A. PCR Amplification of Antibody Fragments 

3 0 The PCR amplification of mRNA isolated from spleen 

cells with oligonucleotides which incorporate restriction 
sites into the ends of the amplified product can be used 
to clone and express heavy chain sequences including Fd 
and kappa chain sequences. The oligonucleotides primers 
35 used for these amplifications are presented in Tables 1 
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and 2. The primers are analogous to those which have been 
successfully used in Example 6 for amplifications of V H 
sequences. The set of 5" primers for heavy chain ampli- 
fication were identical to those previously used to 
5 amplify V H and those for light chain amplification were 
chosen on similar principles, Sastry et al., Proc. Natl. 
Acad. Sci. USA . 8G: 5728 (1989) and Orland et al. , Proc 
Natl. Acari- Sci. USA . 8G:3833 (1989). The unique 3' 
primers of heavy (IgGl) and light (k) chain sequences were 

10 chosen to include the cysteines involved in heavy-light 
chain disulfide bond formation. At this stage no primer 
was constructed to amplify lambda light chains since they 
constitute only a small fraction of murine antibodies. In 
addition, Fv fragments have been constructed using a 3» 

15 primer which is complementary to the to the mRNA in the J 
(joining) region (amino acid 128) and a set of unique 5' 
primers which are complementary to the first strand cDNA 
in the conserved N-terminal region of the processed 
protein. Restriction endonuclease recognition sequences 

2 0 are incorporated into the primers to allow for the cloning 

of the amplified fragment into a lambda phage vector in a 
predetermined reading frame for expression. 

B. Library Construction 

The construction of a combinatorial library was 
25 accomplished in two steps. In the first step, separate 
heavy and light chain libraries were constructed in Lambda 
Zap II V H and Lambda Zap II V L II respectively. In the 
second step, these two libraries were combined at the 
antisymmetric EcoRl sites present in each vector. This 

3 0 resulted in a library of clones each of which potentially 

co-expresses a heavy and a light chain. The actual combi- 
nations are random and do not necessarily reflect the 
combinations present in the B-cell population in the 
parent animal. Lambda Zap II V H expression vector has been 
35 used to create a library of heavy chain sequences for DNA 
obtained by PCR amplifications of mRNA isolated from the 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



116 

spleen of a 129 G ix + mouse previously immunized with p- 
nitrophenyl phosphonamidate (NPN) antigen 1 according to 
formula I (Figure 13) conjugated to keyhole limpet 
hemocyanin (KLH) . 
5 The NPN— KLH conjugate was prepared by admixture of 

250 ul of a solution containing 2.5 mg of NPN according to 
formula 1 (Figure 12) in dimethyl formamide with 750 ul of 
a solution containing 2 mg of KLH in 0.01 M sodium phos- 
phate buffer (pH 7.2). The two solutions were admixed by 

10 slow addition of the NPN solution to the KLH solution 
while the KLH solution was being agitated by a rotating 
stirring bar. Thereafter the admixture was maintained at 
4°C for 1 hour with the same agitation to allow conjuga- 
tion to proceed. The conjugated NPN-KHL was isolated from 

15 the nonconjugated NPN and KLH by gel filtration through 
Sephadex G-25. The isolated NPN-KLH conjugate was used in 
mouse immunizations as described in Example 3 . 

The spleen mRNA resulting from the above immuniza- 
tions was isolated and used to create a primary library of 

20 V H gene sequences using the Lambda Zap II V H expression 
vector. The primary library contains 1.3 x 10 6 pfu and has 
been screened for' the expression of the decapeptide tag to 
determine the percentage of clones expressing Fd 
sequences. The sequence for this peptide is only in frame 

25 for expression following the cloning of an Fd (or V H ) 
fragment into the vector. At least 80% of the clones in 
the library express Fd fragments based on immuno-detection 
of the decapeptide tag. 

The light chain library was constructed in the same 

30 way as the heavy chain and shown to contain 2.5 x 10 6 
members. Plaque screening, using the anti-kappa chain 
antibody, indicated that 60% of the library contained 
expressed light chain inserts. This relatively small 
percentage of inserts probably resulted from incomplete 

35 dephosphorylation of vector after cleavage with Sac I and 
Xba I. 
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Once obtained, the two libraries were used to 
construct a combinatorial library by crossing them at the 
EcoRI site. To accomplish the cross, DNA was first 
purified from each library • The light chain library was 
5 cleaved with Mlul restriction endonuclease, the resulting 
5 1 ends dephosphorylated and the product digested with 
EcoRI. This process cleaved the left arm of the vector 
into several pieces but the light arm containing the light 
chain sequences, remained intact. In a parallel fashion, 

10 the DNA of heavy chain library was cleaved with Hindlll, 
dephosphorylated and cleaved with EcoR I, destroying the 
right arm but leaving the left arm containing the heavy 
chain sequences intact. The DNA 1 s so prepared were then 
combined and ligated. After ligation only clones which 

15 resulted from combination of a right arm of light chain- 
containing clones reconstituted a viable phage. After 
ligation and packaging, 2.5 x 10 7 clones were obtained. 
This is the combinatorial Fab expression library that was 
screened to identify clones having affinity for NPN. To 

20 determine the frequency the phage clones which co-express 
the light and heavy chain fragments, duplicate lifts o the 
light chain, heavy chain and combinatorial libraries were 
screened as above for light and heavy chain expression. 
In this study of approximately 500 recombinant phage 

25 approximately 60% co-expressed light and heavy chain 
proteins . 

C. Antigen Binding 

All three libraries, the light chain, the heavy chain 

* and Fab were screened* to determine if they contained 
3 0 recombinant phage that expressed antibody fragments bind- 

* ing NPN. In a typical procedure 30,000 phage were plated 
and duplicate lifts with nitrocellulose screened for 
binding to NPN coupled to 125 I labeled BSA (Figure 15) . 
Duplicate screens of 80,000 recombinant phage from the 

35 light chain library and a similar number from the heavy 
chain library did not identify any clones which bound the 
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antigen. In contrast, the screen of a similar number of 
clones from the Fab expression library identified many 
phage plaques that bound NPN (Figure 15) . This observa- 
tion indicates that under conditions where many heavy 
5 chains in combination with light chains bind to antigen 
the same heavy or light chains alone do not. Therefore, 
in the case of NPN, it is believed that there are many 
heavy and light chains that only bind antigen when they 
are combined with specific light and heavy chains 

10 respectively. 

To assess the ability to screen large numbers of 
clones and obtain a more quantitative estimate of the 
frequency of antigen binding clones in the combinatorial 
library, one million phage plaques were screened and 

15 approximately 100 clones which bound to antigen were 
identified. For six clones which were believed to bind 
NPN, a region of the plate containing the positive and 
approximately 20 surrounding bacteriophage plaques was 
"cored", replated, and screened with duplicate lifts 

20 (Figures 15) . As expected, approximately one in twenty of 
the phage specifically bind to antigen. "Cores" of 
regions of the plated phage believed to be negative did 
not give positives on replating. 

To determine the specificity of the antigen-antibody 

25 interaction, antigen binding was competed with free 
unlabeled antigen as shown in Figure 16. Competition 
studies showed that individual clones could be distin- 
guished on the basis of antigen affinity. The concentra- 
tion of free hapten required for complete inhibition of 

3 0 binding varied between 10-100 x 10- 9 M suggestion that the 
expressed Fab fragments had binding constants in the 
nanomolar range. 

D. Composition of the Clones and Their Expressed 
Products 

3 5 In preparation for characterization of the protein 

products able to bind NPN as described in Example 19C, a 
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plasmid containing the heavy and light chain genes was 
excised from the appropriate "cored" bacteriophage plaque 
using M13mp8 helper phage. Mapping of the excised plasmid 
demonstrated a restriction pattern consistent with incor- 
5 poration of heavy and light chain sequences. The protein 
products of one of the clones was analyzed by ELISA and 
Western blotting to establish the composition of the NPN 
binding protein. A bacterial supernate following IPTG 
induction was concentrated and subjected to gel filtra- 

10 tion. Fractions in the molecular weight range 40-60 kD 
were pooled, concentrated and subjected to a further gel 
filtration separation. As illustrated in Figure 17, ELISA 
analysis of the eluting fractions demonstrated that NPN 
binding was associated with a protein of molecular weight 

15 about 50 kD which immunological detection showed contained 
both heavy and light chains. A Western blot (not shown) 
of a concentrated bacterial supernate preparation under 
non-reducing conditions was developed with anti- 
decapeptide antibody. This revealed a protein band of 

2 0 molecular weight of 50 kD. Taken together these results 

are consistent with NPN binding being a function of Fab 
fragments in which heavy and light chains are covalently 
linked. 

20. Flp recombinase-catalvzed Recombination 
25 Experiments directed to the in vivo recombination of 

two lambda vectors using flp recombinase-catalyzed recom- 
bination are described. The flp recombination site was 
introduced into the phage vectors using 39mer synthetic 
oligonucleotides. The sequence of the flp site utilized 

3 0 for recombination was derived from several references 

( e.g. Senecoff et al . , Proc. Nat. Acad. Sci. USA 82:7220- 
7224 (1985)). The Xbal site within the 8bp core was elim- 
inated as this site was to be used in the cloning 
strategy. This was accomplished by making a point muta- 
3 5 tion which has little or no effect on its ability to allow 
recombination (McLeod et al . . Mol. Cell. Biol. 6:3357- 
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3367 (1985)). However, this point mutation is not 
required for the system to function • The oligonucleotides 
were further designed to be inserted in the EcoRl sites of 
the Lambda Zap II V H and Lambda Zap II V L vectors so that 
5 only one flanking EcoRl sites would be regenerated (see 
Figure 18) . The flanking sequences are not essential to 
the system. 

The following sequences were inserted into Lamba Zap II V H : 

(110) Oligo 79 EcoRl 
10 AATTCGAAGTTCCTATTCTCTAAAAAGTATAGGAACTTC 3 ■ 

(111) Oligo 80 GCTTCAAGGATAAGAGATTTTTCATATCCTTGAAGTTAA 
5 1 

The following sequences were inserted into Lambda Zap II 

15 (112) O 1 i g o 8 1 

AATTGAAGTTCCTATTCTCTAAAAAGTATAGGAACTTCG EcoRl 3' 
( 113 ) Oligo 82 CTTCAAGGATAAGAGATTTTTCATATCCTTGAAGCTTAA 

5 f 

Vectors were constructed as follows. The first two 

2 0 oligonucleotides were mixed (0.5 jzg oligo 79, 0.5 fig oligo 

80, 1 /il 200 mM Tris, pH 7.4, 20 mM MgCl, 500 mM NaCl, and 
H 2 0 to 10 Ml) / heated to 85 °C 5 min. , and allowed to cool 
to room temperature over 1 hour in a water bath. The 
procedure was repeated using oligos 81 and 82. 

25 Ligation into vector arms was accomplished by 

digesting Lambda Zap V H and Lambda Zap II V L with 3U//xg 
EcoRl according to standard digestion procedure. After 
phenol/ chloroform extraction, DNA was precipitated with 
EtOH. The vector was not phosphatase treated so that the 

30 oligonucleotides could be inserted without kinase treat- 
ment, thus preventing multiple tandem oligonucleotide 
inserts. Ligations were performed in 5 ill volumes using 
1 jug of lambda DNA and 1 ng of annealed oligonucleotides 
according to standard ligation protocol (see Maniatis et 

3 5 al . . supra ) . Ligation mixes were packaged using Gigapack 
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Gold™ (Stratagene Cloning Systems, San Diego , CA) 
according to the protocol recommended in the manual. 

Following packaging, the vectors were screened. 
Packaged DNA was plated according to the Gigapack Gold™ 
5 manual procedure on NZY agar with approximately 400 pfu 
per 100 mm Petri dish. Duplicate plaque lifts were done 
according to the protocol in the Predigested ZapII Cloning 
Manual (Stratagene Cloning Systems, San Diego, CA) on 
nitrocellulose filters. Denaturation and fixation of DNA 

10 onto the membranes is also described in the manual. 
Prehybridization was performed according to pBluescript II 
Exo/Mung DNA Sequencing System™ instruction manual 
(Stratagene Cloning Systems, San Diego, CA) for oligo- 
nucleotide probes (pg 6) . Hybridization was performed 

15 overnight using 32 P kinased oligo 79 (0.5 x 10 6 cpm/ml) 
according to the pBluescript manual (Stratagene Cloning 
Systems, San Diego, CA) . Oligo 79 was kinased using 
standard 32 P gamma ATP labelling techniques (see Maniatis 
et al. , supra). Filters were washed in 6X SSC, 0.1% SDS, 

20 3 times at room temperature, once at 55 °C and finally at 
59 °C. Each was washed for approximately 10 minutes. 
Positive plaques were identified using X-ray autoradio- 
graphy. Twelve duplicate plaques were cored in 500 fxl SM, 
2 0 ill chloroform. These plaques were sufficiently well 

25 isolated that secondary screening was not required. The 
cored plaques were excised according to the Predigested 
Zap II Cloning Manual (Stratagene Cloning Systems, San 
Diego, CA) and DNA from single ampicillin resistant 
colonies was sequenced using minipreped DNA and the T7 and 

3 0 T3 primers according to the DSK 3 5S Sequencing kit 
(Stratagene Cloning Systems, San Diego, CA) . Clones with 
flp sites in the correct orientation and opposite orien- 
tation were identified, amplified and titred. One of each 
type of clone (FLPHC+, FLPHC-, FLPLC+, FLPLC-) was used to 

35 test in vivo flp- mediated recombination. 

In vivo flp-mediated recombination was accomplished 
as follows. Flp recombinase was expressed off the tac 
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promoter on a plasmid, pCS3, in B. coli MM294 strain 
(Lebreton et al. 1988) . This is a low copy number plasmid 
with the pACYC origin of replication and contains a 
chloramphenicol resistance gene. 
5 5 x 10 8 cells were coinfected with FLPHC and FLPLC 

vectors at an moi of 5 and 10 pfu each per cell. 
Combinations of FLPHC+ and FLPLV+, or FLPHC- and FLPLC- , 
or FLPHC+ and FLPLC- were tested. 

Overnight cultures of MM294(pCS3) were grown in LB, 

10 spun down and resuspended in lOmM MgSO A at a density of 
OD 600 = 1.0. The appropriate amounts of phage were added 
to 0.5 ml of cells and allowed to adhere at 37 °C for 15 
minutes. 50 ml of NZY was added to each flask and incu- 
bated for 2 hours with shaking. 250 /il of chloroform was 

15 added to 25 ml of lysate and incubated for 15 minutes at 
room temperature. The supernatants were titred and 
screened for phage containing both Lambda Zap II V H left 
arms and Lambda Zap II V L right arms. Probes to identify 
Zap II V H left arms and Lambda Zap II V L right arms were 

2 0 designed by identifying unique sequences from the known 

sequence of the vectors. 

The Lambda Zap II V H left arm probe had the following 
sequence: 

(114) CTAGTTACCCGTACGACCCCCCCGTTCCGGACTACGCTTCTTAATAG 3 1 
25 This sequence hybridizes to the decapeptide sequence of 
the Lambda Zap II V H . The Lambda Zap II V L right arm probe 
had the following sequence: 

( 115 ) 5 1 GAG CTCGT C AGTT CT AG AGTT AAG CGG C CG 3 1 
This sequence hybridizes to the sequence from the Sacl 
30 site to the former Notl site of the Lambda Zap II V L 
vector . 

The screening procedure used was the same as that 
used to identify the flp vectors, as described above, with 
the exception of washing conditions. Filters were washed 

3 5 with 6XSSC, 1%SDS 3 times at room temperature and twice at 

60 °C. Plaques which hybridized to both probes were 
identified by X-ray autoradiography, cored, excised and 
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digested to determine if recombination had occurred. 
Control plaques identified as hybridizing to only one 
probe and to neither probe were also cored. Diagnostic 
restriction digests were PvuII, Pvul, Xhol, Xhol/Pvul, 
5 Sacl, Sac/PVul, Notl, Xbal, Seal, Spel, Spel/Puul. 
Restriction digest results verified that recombination at 
the flp site occurred in vivo in cells expressing the flp 

recombinase gene and not in control SURE™ E. coli cells 

(Stratagene Cloning Systems, San Diego, CA) which do not 

10 normally express flp recombinase. 

Efficiency of recombination according to the number 
of plaques identified as hybridizing to both probes was 
initially between about 5-10%. Changes to the protocol 
can be made, however, which will improve the efficiency of 

15 recovery of recombined vectors. For example, by adding 
selectable marker sequences to the left and right arms of 
the vectors, up to 100% of target recombinants can be 
identified (Figure 20) . Adding selection systems to 
ensure that all recombinants contain inserts will also 

2 0 increase the efficiency of identifying the desired clones. 

In Example 19 a relatively restricted library was 
prepared because only a limited number of primers were 
used for PCR amplification of Fd sequences. The library 
is expected to contain only clones expressing kappa/gamma 
25 sequences. However, this is not an inherent limitation of 
the method since additional primers can be added to 
amplify any antibody class or subclass. Despite this 
restriction we were able to isolate a large number of 
antigen binding clones. Of interest is how a phage 

3 0 library prepared as described herein compares with the in 

vivo antibody repertoire in terms of size, characteristics 
of diversity, and ease of access. 

The size of the mammalian antibody repertoire is 
difficult to judge but a figure of the order of 10 6 -10 8 
3 5 different antigen specificities is often quoted. With 
some of the reservations discussed below, a phage library 
of this size or large can readily be constructed by a 
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modification of the current method. In fact once an 
initial combinatorial library has been constructed, heavy 
and light chains can be shuffled to obtain libraries of 
exceptionally large numbers. 
5 In principle, the diversity characteristics of the 

naive (unimmunized) in vivo repertoire and corresponding 
phage library are expected to be similar in that both 
involve a random combination of heavy and light chains. 
However, different factors will act to restrict the 
10 diversity expressed by an in vivo repertoire and phage 
library. For example a physiological modification such as 
tolerance will restrict the expression of certain anti- 
genic specificities from the in vivo repertoire but these 
specificities may still appear in the phage library. For 
15 example, the representation of mRNA for sequences 
expressed by stimulated B-cells can be expected to 
predominate over those of unstimulated cells because of 
higher levels of expression. Different source tissues 
( e.g. , peripheral blood, bone marrow or regional lymph 
20 nodes) and different PGR primers ( e.g. . ones expected to 
amplify different antibody classes) may result in library 
with different diversity characteristics. 

Another difference between in vivo repertoire and 
phage library is that antibodies isolated from the former 
25 may have benefited from affinity maturation due to somatic 
mutations after combination of heavy and light chains 
whereas the latter randomly combines the matured heavy and 
light chains. Given a large enough phage library derived 
from a particular in vivo repertoire, the original matured 
3 0 heavy and light chains will be recombined. However, since 
one of the potential benefits of this new technology is to 
obviate the need for immunization by the generation of a 
single highly diverse "generic" phage library, it would be 
useful to have methods to optimize sequences to compensate 
3 5 for the absence of somatic mutation and clonal selection. 
Three procedures are made readily available through the 
methods of the present invention. First, saturation muta- 
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genesis may be performed on the CDR 1 s and the resulting 
Fabs can be assayed for increased function. Second, a 
heavy or a light chain of a clone which binds antigen can 

> 

be recombined with the entire light or heavy chain 
5 libraries respectively in a procedure identical to the one 
used to construct the combinatorial library. Third, iter- 
ative cycles of the two above procedures can be performed 
to further optimize the affinity or catalytic properties 
of the immunoglobulin. It should be noted that the latter 
10 two procedures are not permitted in B-cell clonal selec- 
tion which suggests that the methods described here may 
actually increase the ability to identify optimal 
sequences . 

Access is the third area where it is of interest to 
15 compare the in vivo antibody repertoire and phage library. 
In practical terms the phage library is much easier to 
access. The screening methods allow one to survey at 
least 50,000 clones per plate so that 10 6 antibodies can be 
readily examined in a day. This factor alone should 

2 0 encourage the replacement of hybridoma technology with the 

methods described here. The most powerful screening 
methods utilize selection which may be accomplished by 
incorporating selectable markers into the antigen such as 
leaving groups necessary for replication of auxotrophic 
25 bacterial strains or toxic substituents susceptible to 
catalytic inactivation. There are also further advantages 
related to the fact that the in vivo antibody repertoire 
can only be accessed via immunization which is a selection 
on the basis of binding affinity. The phage library is 

3 0 not similarly restricted. For example, the only general 

method to identify antibodies with catalytic properties 
has been by pre-selection on the basis of affinity of the 
antibody to a transition state analogue. No such restric- 
tions apply to the in vivo library where catalysis can, in 
35 principle, be assayed directly. The ability to directly 
assay large numbers of antibodies for function may allow 
selection for catalysts in reactions where a mechanism is 
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not well defined or synthesis of the transition state 
analog is difficult. Assaying for catalysis directly 
eliminates the bias of the screening procedure for 
reaction mechanisms pejorative to a synthetic analog and 
5 therefore simultaneous exploration of multiple reaction 
pathways for a given chemical transformation are possible. 

Although we have given examples of several screening 
methods, it should be clear to one skilled in the art that 
alternative methods of screening, usch as by panning dels 
10 or particles expressing the protein product on their sur- 
face would essentially be equivalent. If the expressed 
gene products of interest are RNA molecules instead of 
proteins, screening could be accomplished by nucleic acid 
hybridization or by detecting some functional property of 
15 th eRNA, such as ribozyme catalysis. 

The methods disclosed herein describe generation of 
Fab fragments which are clearly different in a number of 
important respects from intact (whole) antibodies. There 
is undoubtedly a loss of a affinity in having monovalent 
20 Fab antigen binders but this can be compensated by selec- 
tion of suitably tight binders. For a number of applica- 
tions such as diagnostics and biosensors it may be prefer- 
able to have monovalent Fab fragments. For applications 
requiring Fc effector functions, the technology already 
25 exists for extending the heavy chain gene and expressing 
the glycosylated whole antibody in mammalian cells. 

The ideas presented here address the bottle neck in 
the identification and evaluation of antibodies. It is 
now possible to construct and screen at least three orders 
3 0 of magnitude more clones with mono-specificity than previ- 
ously possible. The potential applications of the method 
should span basic research and applied sciences. 

21. Oligonucleotide Primer Design for Producing 
Dicistronic DNA 

35 A method based on PCR amplification that fuses heavy 

and light chain sequences has been used to construct a 
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complete antigen binding domain of a Fab protein fragment 
composed of a heavy and a light chain. Schematic diagrams 
of an immunoglobulin molecule composed of heavy and light 
chains containing constant and variable regions is shown 
5 in Figure 1. Human heavy chain IgG and human kappa light 
chain are diagrammatically sketched in Figures 2 A and 2B, 
respectively. To accomplish this procedure, immunoglobu- 
lin heavy and light chain primers were designed to produce 
a region of homology between two polymerase chain reaction 

10 (PCR) products. The complementary regions have been shown 
to hybridize predominantly under conditions where one set 
of primers ("inside primer pair") is used in a limiting 
amount relative to the other set of primers ("outside 
primer pair") . After the 3 1 ends of the PCR products have 

15 hybridized, the DNA polymerase has been shown to extend 
the ends creating a fusion sequence carrying the unique 
sequences of both PCR fragments separated by one copy of 
region X cistronic bridge. A two-step cloning procedure 
is thus avoided. When the recombinant sequence is then 

2 0 inserted into an expression vector such as ImmunoZAP, a 

fusion production capable of simultaneously expressing the 
heavy and light chains can be produced. 

The strategy used for producing immunoglobulin heavy 
and light chain PCR dicistronic DNA is shown schematically 
25 in Figure 21. Regions of the immunoglobulin heavy chain 
coding strand are designated V H , C H 1, C H 2 , and C H 3 corres- 
ponding to functional regions in the protein. The corres- 
ponding regions of the non-coding strand are designated by 
a prime ( 1 ) . Regions V L and C L are similarly labelled for 

3 0 the kappa light chain. This procedure can also be per- 

formed using lambda light chain specific regions. A 
region, X, unrelated to the natural immunoglobulin 
sequences, is introduced into the fusion product by 
attaching X to the 5 1 ends of both of the C H 1 1 and V L 
35 inside primers. 

Overlapping oligonucleotide primers used in the 
fusion-PCR reactions to produce dicistronic DNA were 
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designed to encode the following: amino acids of 225 to 
230 of the IgG heavy chain hinge region which are common 
to all human IgG isotypes; an Spe I restriction site; two 
stop codons; a ribosome binding site; a periplasmic (pelB) 
5 leader sequence (Better, et al., Science , 240:1041-1043 
(1988); Lei, et al., J. Bacterid . . 169:4379-4383 (1988)); 
a Sac I restriction site which encodes amino acids 1 and 
2 of the mature kappa light chain; and amino acids 3 to 8 
of the mature kappa light chain. The X region was 

10 designed to contain a ribosome binding site and a pelB 
leader to ensure expression of the light chain. 
Nucleotide sequences for all human and mouse PCR primers, 
both inside and outside, are listed in Table 11. Primers 
followed by a prime ( ■ ) represent non-coding strand 

15 sequences . 



Table 11 

Human and Mouse PCR Primers 
Seq. 

Id. No. Human 

2 0 ( 117 > V H 5 ' -GTCCTGTCCGAGGTGCAGCTGCTCGAGTCTGG-3 1 

( 118 ) C H 1 1 5 ■ -AATAACAATCCAGCGGCTGCCGTAGGCAATAGGT 

ATTTCATTATGACTGTCTCCTTGCTATTAACTAG 
TACAAGATTTGGGCTC- 3 ■ 

(119) V L 5 1 -GCCTACGGCAGCCGCTGGATTGTTATTAATCGCT 

2 5 GCCCAACCTGCCATGGCTGAGCTCGTGATGACCC 

CAGTCTCC-3 ' 

(12 0) C L 1 5 1 -TCCTTCTAGATTACTAACACTCTCCCCTGTTGAA 

GCTCTTTGTGACGGGCGAACTC-3 f 

Mouse 

30 (121) V H 5 1 — AGGT C C AG CTG CT CG AGT CTGG - 3 ■ 

(122) C H 1 « 5 ■ — AATAACAATCCAGCGGCTGCCGTAGGCAATAGG 

TATTTCATTATGACTGTCTCCTTGCTATTAACT 
AGTATACAATCCCTGGGCACAAT- 3 1 

(123) V L 5 ' -GCCTACGGCAGCCGCTGGATTGTTATTAATCGC 

3 5 TGCCCAACCTGCCATGGCTGAGCTCGTGATGAC 

CCAGTCTCC-3 1 
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(124) C L 1 5 1 -TCCTTCTAGATTACTAACACTCTCCCCTGTTGAA-3 1 

The overlapping regions of the human C H l f inside and 
V L inside primers are illustrated in Figure 22. The heavy 
5 chain downstream C H 1 1 inside primer sequence is written 3 1 
to 5 f and the light chain upstream V L inside primer 
sequence is written 5 ' to 3 1 . The complementary PCR 
product strands, and not the primer strands, cross-prime 
to create the dicistronic molecule. Bold nucleotides 

10 represent regions where the C H 1' inside primer hybridizes 
to the 3 1 end of C H 1 on human IgG heavy chain mRNA or where 
the V L inside primer hybridizes to the 5 1 end of V L frame- 
work on human kappa light chain cDNA. The amino acid and 
nucleotides in italics represent changes in sequence from 

15 the original pelB leader sequence. 

At amino acid 15 of the pelB leader sequence, the 
codon was changed from CTC to ATC resulting in a conserv- 
ative amino acid change from a leucine to an isoleucine as 
shown in Figure 22 and Table 11. Hydrophobic amino acids 

2 0 in the core region of periplasmic leader sequences have 

been shown to be essential for correct processing of the 
leader sequence and transport of the mature protein to the 
periplasm. Oliver, in Neidhardt, R.C. (ed.), Escherichia 
coli and Salmonella Tvohimurium. , Am. Soc. Microbiol., 

25 1:56-69 (1987) . The nucleotide changes were made to allow 
for the artifactual insertion of one or two dATPs at the 
3 1 end of the overlapping dicistronic molecules. Thermus 
aquaticus (Taq) DNA polymerase may add a dATP to the 3 1 
end of the PCR product because of terminal transferase 

30 activity. Jiang, etgal. Oncogene , 4:923-928 (1989). The 
additional dATP would then cause a mismatch between the 
overlapping PCR products at the 3 1 terminus and inhibit 
elongation by Taq DNA polymerase. Sommer, et al. Nucl. 
Acids Res. , 17:6749 (1989). Therefore, the change to two 

3 5 dTTPs in this position of the oligonucleotide primers 

would allow proper base pairing if up to two dATPs were 
added to the 3' terminus of the heavy chain PCR product. 
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The kappa light chain PCR product was designed to termi- 
nate at a position where two dTTPs occur 5 1 of the end of 
the product and did not require alterations of the nucleo- 
tide sequence. Nucleotides were changed in the kappa 
5 light chain primer encoding the pelB leader sequence with- 
out introducing amino acid changes in order to decrease 
the number of mismatches between the primer and the leader 
sequence of the kappa light chain mRNA as shown in Figure 
22 and Table 11. 
10 All primers were synthesized on an Applied Biosystems 

DNA synthesizer, Model 381A, following the manufacturer's 
instructions . 

22. Preparation of a V„-and V L -Codina Repertoire 
A. Preparation of a V u -and V L -Codincr REpertoire from a 
15 Human cDNA Combinatorial Library 

Cloned DNA, previously isolated from a combinatorial 
library that encodes human Fab fragments which bind 
tetanus toxoid (TT) was used as a template for preparing 
a V H -and V L -coding repertoire. Mullinax, et al., supra . 
20 Briefly, the combinatorial library was prepared by the 
following approach. Volunteer donors, who had been pre- 
viously immunized against tetanus but had not received 
booster injections within the last year, received injec- 
tions on 2 consecutive days of 0.5 milliliters (ml) of 
25 alum-absorbed tetnus toxoid (TT) (40 microgram/ml (ug)/ml) 
(Connaught Laboratories, Swiftwater, Pennsylvania) . 

One hundred ml of blood was drawn from the volunteers 
6 days post injection and anticoagulated with a mixture of 
0.14 M citric acid, 0.2 M trisodium citrate, and 0.22 M 
3 0 dextrose. The peripheral blood lymphocytes (PBLs) were 
recovered and isolated from the whole blood by layering 
the whole blood on Histopaque-1077 (Sigma, St. Louis, 
Missouri) and centrifuging at 400 x g for 30 minutes at 25 
degrees Celsius (25 °C) . Isolated PBLs were washed twice 
3 5 with phosphate buffered saline (PBS) (150 mM sodium 
chloride and 150 mM sodium phosphate, pH 7.2 at 25 °C) . 
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Total RNA was then purified from the PBLs (10 6 B cells 
per ml blood per 100 ml of blood) for an enriched source 
of B-cell mRNA coding for antiTT IgG using an RNA isola- 
tion kit according to manufacturer's instructions 
5 (Stratagene, La Jolla, California) and also described by 
Chomczynski et al., Anal. Biochem. , 162:156-159 (1987). 
Briefly, the isolated PBLs were homogenized in 10 ml of a 
denaturing solution containing 4.0 M guanine isothiocyan- 
ate, 0.25 M sodium citrate at pH 7.0, and 0.1 M beta- 

10 mercaptoethanol . One ml of sodium acetate at a concen- 
tration of 2 M at pH 4.0 was admixed with the homogenized 
cells. Ten ml of phenol that had been previously satur- 
ated with H 2 0 was also admixed to the denaturing solution 
containing the homogenized cells. Two ml of a chloroform: 

15 isoamyl alcohol (24:1 v/v) mixture was added to this homo- 
genate. The homogenate was mixed vigorously for ten 
seconds and maintained on ice for 15 minutes. The homo- 
genate was then transferred to a thick-walled 50 ml 
polypropylene centrifuged tube (Fisher Scientific Company, 

20 Pittsburgh, Pennsylvania). The solution was centrifuged 
at 10,000 x g for 20 minutes at 4°C. The upper RNA- 
containing aqueous layer was transferred to a fresh 50 ml 
polypropylene centrifuge tube and mixed with an equal 
volume of isopropyl alcohol. This solution was maintained 

25 at -20 °C for at least one hour to precipitate the RNA. 
The solution containing the precipitated RNA was centri- 
fuged at 10,000 x g for twenty minutes at 4°C. The 
pelleted total cellular RNA was collected and dissolved in 
3 ml of the denaturing solution described above. Three ml 

3 0 of isopropyl alcohol was added to the re-suspended total 
cellular RNA and inverted to mix. This solution was main- 
tained at -20°C for at least 1 hour to precipitate the 
RNA. The solution containing the precipitated RNA was 
centrifuged at 10,000 x g for ten minutes at 4°C. The 

35 pelleted RNA was washed once with a solution containing 
75% ethanol. The pelleted RNA was dried under vacuum for 
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15 minutes and then re-suspended in diethyl pyrocarbonate 
(DEPC) treated (DEPC-H 2 0) H 2 0) . 

Messenger RNA (mRNA) was prepared from the total 
cellular RNA using methods described in Molecular Cloning 

5 A Laboratory Manual . Maniatis et al., eds., Cold Spring 
Harbor, NY, (1982). Briefly, 500 mg of the total RNA 
isolated from a PBLs prepared as described above was re- 
suspended in one ml of IX sample buffer (1 mM Tris-HCl, 
(Tris [hydroxylmethyl-aminomethane] ) pH 7 . 5 ; 0 . 1 mM EDTA 
10 (disodium ethylene diamine tetra-acetic acid) , 0.5 M NaCl) 
and maintained at 65 °C for five minutes and then on ice 
for five more minutes. The mixture was then applied to an 
oligo-dT (Stratagene) column that was previously prepared 
by washing the oligo-dT with a solution containing 10 mM 

15 Tris-HCl, pH 7.5; 1 mM EDTA, 0.5 M NaCl. The eluate was 
collected in a sterile polypropylene tube and reapplied to 
the same column after heating the eluate for five minutes 
at 65 °C. The oligo dT column was then washed with 0.4 ml 
of high salt loading buffer consisting of 10 mM Tris-HCl 

20 at pH 7.5, 500 mM sodium chloride, and 1 mM EDTA. The 
oligo dT column was then washed with 2 ml of 1 X low salt 
buffer consisting of 10 mM Tris-HCl at pH 7.5, 100 mM 
sodium chloride, and 1 mM EDTA. The messenger RNA was 
eluted from the oligo dT column with 0.6 ml of buffer 

25 consisting of 10 mM Tris-HCl at pH 7.5, and ImM EDTA. The 
messenger RNA was purified by extracting this solution 
with phenol/chloroform followed by a single extraction 
with 100% chloroform. The messenger RNA was concentrated 
by ethanol precipitation and re-suspended in DEPC H 2 0. 

3 0 The messenger RNA isolated by the above process 

contains a plurality of different V H and V L coding poly- 
nucleotides, i.e., greater than about 10 4 different V H - 
and V L -coding genes. 

Isolated RNA was converted to cDNA by a primer exten- 

35 sion reaction with a first-strand synthesis kit according 
to manufacturer's instructions (Stratagene) by using an 
oligo (dT) primer for the light chain and a specific 
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primer, C H 1 1 , for the heavy chain. Mullinax et al., supra . 
In a typical 50 pi transcription reaction, 5 ug of PBL 
mRNA in water was first hybridized (annealed) with 200 ng 
(50.0 pmol) of an oligo (dT) primer for the light chain. 
5 In a separate reaction, 5 ug of PBL mRNA in water was 
first hybridized (annealed with 2 00 ng (20 pmol) of the 
heavy chain primer, C H l f , at 65°C for five minutes. 
Subsequently, the mixture was adjusted to 0.5 mM each of 
dATP, dCTP, dGTP and dTTP, 50 mM Tris-HCl at pH 8.3, 3 mM 

10 MgCl 2 75 mM KC1 , 10 mM DTT, 2 0 units of RNase block II 
(Stratagene) , and 20 units of Moloney-Murine Leukemia 
virus reverse transcriptase (Stratagene Cloning Systems) , 
was added and the solution was maintained for 1 hour at 
37 °C. PGR amplification of the heavy and light chain 

15 sequencers was done separately using 0.25-0.5 ug of first- 
strand synthesis product as template with sets of primer 
pairs using Taq DNA polymerase as described in Example 23. 

The PCR amplified light chain DNA fragments were then 
digested with Sac I and Xba I and ligated into a modified 

2 0 Lambda Zap II vector as prepared in Example 29 to form a 
light chain ImmunoZap Library (ImmunoZAP L; Stratacyte, La 
Jolla, California) . The PCR amplified heavy chain DNA was 
digested with Spe I and Sho I and ligated into a different 
modified Lambda Zap II vector as prepared in Example 27 to 

25 form a heavy chain ImmunoZap Library (ImmunoZAP H; 
Stratacyte) . The resulting libraries were amplified and 
the resulting DNA was packaged into bacteriophage with in 
vitro packaging extract, Gigapack II gold (Stratagene) and 
used to infect E. coli strain XLl-Blue (Stratagene) . 

30 To construct a library for coexpression, the right 

art of the heavy chain library phage DNA was digested with 
Hind III, preserving the left arm of ImmunoZAP H with a 
heavy chain inserts. The left arm of the light chain 
library phage DNA was digested with Mlu I resulting in a 

35 right arm of ImmunoZAP with kappa light chain inserts. 
Both products were then digested with EcoRI and ligated 
to create a combinatorial library that encoded human Fab 
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fragments including those specific for TT. Mullinax, et 
al. , supra . 

Reactive plaques were first identified by binding to 
tetanus toxoid as described in Example 31. Bacteriophage 
5 from purified reactive plaques were then converted to the 
plasmid format by in vivo excision with R408 helper phage 
(Stratagene) following methods described in Example 31 and 
familiar to one skilled in the art. Short, et al., Nucl. 
Acids . Res . . 16:7583-7600 (1988) . The resulting purified 
10 plasmid DNA encoding heavy and light chain was then used 
in PCR reactions as described below in Example 23. 

B. Preparation of a V n - and V L -Codina Repertoire from 
mRNA from Tissues and Cells 

(i) Human 

15 Purified populations of PBLs, other lymphocytes, and 

hybridomas which express immunoglobulins including IgG, 
IgM, IgE, IgD, and IgA are used as sources for isolating 
mRNA encoding immunoglobulins. PBL's and other immuno- 
globulin expressing lymphocytes are isolated from either 

20 spleen, lymphoid tissue or plasma. Following purification 
of the cells, total RNA is then purified from these cells 
using a RNA isolation kit (Stratagene) as described in 
Example 22a. The purified RNA is then converted to cDNA 
with a first-strand synthesis kit as described in Example 

25 22a. The resultant cDNA is then used as a template in PCR 
amplification reactions as described below in Example 23 
for the production of dicistronic molecules expressing 
heavy and light chains. 

(ii) Mouse 

3 0 Populations of cells described above can be isolated 

from other mammalian sources such as mouse or rabbit. 
Both mRNA and rearranged DNA can be isolated as described 
above . and used as templates in PCR amplification reac- 
tions. cDNA synthesized from mRNA isolated from a mouse 

35 anti-human fibronectin hybridoma (ATCC, CRL-1606) was used 
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as a preferred template for the production of dicistronic 
molecules expressing heavy and light chain. 

c. Pre paration ^ a v. -C oHina Repertoire from Rearranged 
DNA 

5 Rearranged DNA isolated from PBLs, other lymphocytes, 

and hybridomas which express immunoglobulins can be used 
to prepare a V H -coding repertoire. The amplification 
procedure for preparing a V H -coding repertoire using 
rearranged DNA is performed as described in Example 23. 

10 23. Preparati on of DNA Homoloqs 

A. ^ -coding rouble Stranded DNA pomoj-ogs 

Cloned DNA, prepared in Example 22 from a combina- 
torial library that encodes human Fab fragments which bind 
tetanus toxoid (TT) , was used as a template for preparing 

15 a V H -coding double stranded DNA homolog. Human heavy 
chain, containing both the V H and C H 1 coding region and 
designated as Fd, was amplified in a PCR reaction. THe 
amplification was performed in a 100 ul reaction contain- 
ing 5 nanograms (ng) of the cloned DNA in PCR buffer 

20 consisting of the following: 10 mM Tris-HCl, pH 8.3; 50 
mM KC1, 1.5 mM MgCl 2 ; 0.001% (w/v) gelatin; 200 mM of each 
dNTP; 200 nanomolar (nM) of each primer; and 2.5 units of 
Tag DNA polymerase. The human V H outside primer and C„l' 
inside primer were used as a PCR primer pair for amplifi- 

25 cation of the heavy chain (Table 11 and Figure 21) . The 
reaction mixture was overlaid with mineral oil and sub- 
jected to 40 cycles of amplification. Each amplification 
cycle (thermocycle) involved denaturation at 94 °C for 1.5 
minutes, annealing at 54 «C for 2.5 minutes and poly- 
3 0 nucleotide synthesis by primer extension (elongation) at 
72 °C for 3.0 minutes followed by a return to the denatur- 
ation temperature. The resultant amplified V H -coding DNA 
homolog containing samples were then gel purified, 
extracted twice with phenol/ chlorof orm, once with chloro- 
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form followed by ethanol precipitation and were stored at 
"70°C in 10 mM Tris-HCl, pH 7.5, and 1 mM EDTA. 

To verify the amplification of the heavy chain, the 
PCR purified products were electrophoresed in an agarose 
5 gel. The expected size of the heavy chain was approxi- 
mately 730 base pairs as shown in Figure 23. The V H - 
coding double stranded DNA homologs were then used in 
subsequence PCR amplification reactions with V L -coding 
counterparts prepared below for the production of dicis- 
10 tronic DNA molecules having V H and V L cistronic portions as 
illustrated in Example 24. 

B - V L -Codina Doubl e Stranded DNA Homologs 

Cloned DNA, prepared in Example 22 from a combina- 
torial library that encodes human Fab fragments which bind 
15 tetanus toxoid (TT) , was used as a template for preparing 
a V L -coding double stranded DNA homolog. Human light 
chain, containing the entire coding region of kappa light 
chain (V L and C L ) , was amplified using the same PCR condi- 
tions described for human heavy chain with the exception 

2 0 that a human V L inside primer and C L f outside primer were 

used as the PCR primer pair (Table 11 and Figure 21) . The 
resultant V L -coding double stranded DNA homolog was gel 
purified and stored as described above. 

To verify the amplification of the light chain, the 
25 PCR purified products were electrophoresed in an agarose 
gel. The expected size of the light chain was approxi- 
mately 690 base pairs as shown in Figure 23. The VL- 
coding double stranded DNA homologs were then used in 
subsequent PCR amplification reactions with V H -coding 

3 0 counterparts prepared above for the production of 

dicistronic DNA molecules as illustrated in Example 24. 
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24. Preparation of Internallv-Prim ed Duplexes of V„- and 
V L -Codina DNA Homolog 

A. Hybridization of V, r with V L -Codinq DNA Homologs 

The v„- and V. -coding double stranded DNA homologs 
5 prepare in Examples 23A and 23B, respectively, were 
admixed together and denatured at 95 °C for 5 minutes to 
separate the strands of each homolog. The denatured V H - 
and V L -coding DNA strands in the admixture were then 
annealed at 54 *C for 5 minutes to form a V H - and V L -coding 

10 duplex DNA molecule hybridized at the 3 1 ends at region X 
of each original homolog. One strand of the X region 
(cistronic) bridge encodes at least one stop codon in the 
same reading frame as the upstream cistron, a ribosome 
binding site downstream from the stop codon, and a 

15 polypeptide leader (pelB) having a translation initiation 
codon in the same reading frame as the downstream cistron 
located downstream from the ribosome binding site. 

B. Primer Extension to Produce Dicistronic DNA Molecules 
The hybridized recombinant V H - and V L -coding DNA 

20 molecule (internally primed duplex) was subjected to 
primer extension and then amplified with only the V H and 
C L » primers following the PCR reaction procedure described 
in Example 23A. This second PCR reaction is schematically 
represented in Figure 21. The PCR reaction products were 

25 gel electrophoresed to verify the presence of the result- 
ant V H - and V L -coding dicistronic DNA molecules. The 
expected size of the dicistronic molecule was about 1390 
base pairs and is shown in Figure 23. The resultant V H - 
and V L -coding dicistronic DNA molecules were then ligated 

3 0 into the modified ImmunoZAP H vector (Figures 2 4A and 24B) 
for the construction of expression vectors as described in 
Example 30. 
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2 . Preparation of Mouse Hvbridoma V„- and V L -Coding 
Double Stranded DNA Homologs and Production of 
Dicistronic DNA Molecules in a Single Amplifi cation 
Reaction 

5 Mouse hybridoma heavy and light chain cDNA prepared 

in Example 22B was amplified in a single PCR reaction 
using the reaction conditions given above with an excess 
of the outside primers (200 nM concentration of both the 
mouse V H primer and C L » primer) and a limiting amount of 
10 the inside primers (20 nM concentration of both the mouse 
C H l f and V L primer) (Table 11). The resultant mouse heavy 
and light chain dicistronic molecules were then inserted 
into a modified ImmunoZAP H for construction of an 
expression vector as described in Example 30. 



15 26. Preparation of Internal lv-Primed Duplexes Using a 
Single Internal Primer that Overlaps Both the V u and 
V L Repertoires 

Another approach to producing a library of dicis- 
tronic DNA molecules is to use a single internal primer 

2 0 instead of using two separately internal primers. The 

process of creating a dicistronic molecule comprising an 
upstream V H cistron and a downstream V L cistron is to 
combine in a PCR buffer the following: a repertoire of V H 
genes consisting of at least 10 5 different genes; a reper- 
25 toire of V L genes consisting of at least 10 4 different 
genes; an outside V H primer; an outside V L ; and a poly- 
nucleotide strand having a 3' -terminal priming portion, a 
cistronic bridge coding portion, and a 5 1 terminal primer- 
template portion. The PCR reaction is performed as 

3 0 described in Example 22A. 

The 3 1 -terminal priming portion of a polynucleotide 
strand (linker) has a nucleotide base sequence homologous 
to a portion of the primer extension product of one of the 
outside primers. The 5 '-terminal priming portion encodes 
35 a nucleotide base sequence homologous to a portion of the 
primer extension product of the other outside primer. The 
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cistronic bridge coding portion encodes at least one stop 
codon in the same reading frame as the upstream cistron, 
a ribosome binding site downstream from the stop codon and 
a polypeptide leader (pelB) having a translation initia- 
5 tion codon in the same reading frame as the downstream 
cistron where the initiation codon is located downstream 
from the ribosome binding site. Polynucleotide strand 
(linker) primers useful in this invention are listed in 
Table 12. 



10 Table 12 

Polynucleotide Strand ( Linker) Primers 

Seq. 
Id. No. 

( 1 2 5 1 ) 1 1 1 5 1 GGAGAGTGGGTCATCACGAGCTCAGCCATGGCAGGTTGG 
15 GCAGCGATTAATAACAATCCAGCGGCTGCCGTAGGCAAT 

AGGTATTTCATTATGACTGTCTCCTTGCTATTAACTAGT 

ACAAGATTTGGGCTC 3" 
( 12 6 ) 2 2 1 5 1 GAGCCCAAATCTTGTACTAGTTAATAGCAAGGAGACAGT 

CATAATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 

2 o GTTATTAATCGCTGCCCAACCTGCCATGGCTGAGCTCGT 

GATGACCCACTCTCC 3 1 

1 Primes mRNA (sense strand) of heavy chain C H 1 
region; antisense strand of light chain V L with dicistronic 
bridge in between heavy and light chains will be in the 

25 same relative orientation as given in the example. 

2 Primes antisense strand of heavy chain C H 1 regions; 
and sense strand of light chain V L region with dicistronic 
in between heavy and light chains will be in the same 
relative orientation as given in the example. 

3 0 The resultant single step internally primed 

dicistronic DNA molecule can then be ligated into modified 
ImmunoZAP H for construction of an expression vector as 
described in Example 30. 
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27. Preparation of Lambda Zap II Expression Vector 

The vector Lambda Zap™ II (Stratagene) is a 
derivative of the original Lambda Zap (ATCC # 40,298) that 
maintains all of the characteristics of the original 
5 Lambda Zap including 6 unique cloning sites, fusion 
protein expression, and the ability to rapidly excise the 
insert in the form of a phagemid (Bluscript SK-) , but 
lacks the SAM 100 mutation, allowing growth on many Non- 
Sup F strains, including XLl-Blue. The Lambda Zap II was 

10 constructed as described in Short et al., Nucleic Acids 
Res. , 16:7583-7600, (1988), by replacing the Lambda S gene 
contained in a 4254 base pair (bp) DNA fragment produced 
by digesting Lambda Zap with the restriction enzyme Ncol. 
This 4254 bp DNA fragment was replaced with the 4254 bp 

15 DNA fragment containing the Lambda S gene isolated from 
Lambda gtlO (ATCC # 40,179) after digesting the vector 
with the restriction enzyme Ncol. The 4254 bp DNA frag- 
ment isolated from lambda gtlO was ligated into the 
original Lambda Zap vector using T4 DNA ligase and 

20 standard protocols for such procedures described in 
Current Protocols in Molecular Biology . Ausubel et al., 
eds., John Wiley and Sons, NY, 1987. 

28 . Preparation of V n -Expression Vectors, ImmunoZAP H and 
Modified ImmunoZAP H. Construction 

25 A. ImmunoZAP H 

The main criterion used in choosing a vector system 
was the necessity of generating the largest number of Fab 
fragments which could be screened directly. Bacteriophage 
lambda was selected as the expression vector for three 

3 0 reasons. First, in vitro packaging of phage DNA is the 
most efficient method of reintroducing DNA into host 
cells. Second, it is possible to detect protein expres- 
sion at the level of single phage plaques. Finally, the 
screening of phage libraries typically involve less dif f i- 

35 culty with nonspecific binding. The alternative, plasmid 
cloning vectors, are only advantageous in the analysis of 
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clones after they have been identified. This advantage is 
not lost in the present system because of the use of 
lambda Zap, thereby permitting a plasmid containing the 
heavy chain, light chain, or Fab expressing inserts to be 
5 excised. 

To express the plurality of V H -coding DNA homologs in 
an E. coli host cell, a vector was constructed that placed 
the V H -coding DNA homologs in the proper reading frame, 
provided a ribosome binding site as described by Shine et 

10 al., Nature, 254:34, (1975), provided a leader sequence 
directing the expressed protein to the periplasmic space, 
provided a polynucleotide sequence that coded for a known 
epitope (epitope tag) and also provided a polynucleotide 
that coded for a spacer protein between the V H -coding DNA 

15 homolog and the polynucleotide coding for the epitope tag. 
A synthetic DNA sequence containing all of the above 
polynucleotides and features was constructed by designing 
single stranded polynucleotide segments of 2 0-40 bases 
that would hybridize to each other and form the double 

2 0 stranded synthetic DNA sequence shown in Figure 25A. The 
individual single-stranded polynucleotides (N-,-N 12 ) are 
shown in Table 13 below. 

Table 13 
Seq. 
25 Id. No. 



(91) 


Nl) 


5' 


GGCCGCAAATTCTATTTCAAGGAGACAGTCAT 3 1 


(92) 


N2) 


5 1 


AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3 • 


(93) 


N3) 


5 ' 


GTTATTACTCGCTGCCCAACCAGCCATGGCCC 3 1 


(94) 


N4) 


5' 


AGGTGAAACTGCTCGAGAATTCTAGACTAGGTTAATAG 3 1 


30 (95) 


N5) 


5 ' 


TCGACTATTAACTAGTCTAGAATTCTCGAG 3 1 


(96) 


N6) 


5 1 


CAGTTTCACCTGGGCCATGG CTGGTTGGG 3 1 


(97) 


N7) 


5' 


CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3 ' 


(98) 


N8) 


5 1 


GTATTTCATTATGACTGTCTCCTTGAAATAGAATTTGC 3 1 


(99) 


N9-4) 5 


• AGGTGAAACTGCTCGAGATTTCTAGACTAGTTACCCGTAC 3 


35 (100) 


Nil) 


5' 


GACGTTCCGGACTACGGTTCTTAATAGAATTCG 3 1 


(101) 


N12) 


5' 


TCGACGAATTCTATTAAGAACCGTAGTC 3 1 
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(102) N10-5) 5" CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 3' 
Polynucleotide N2, N3 , N9-4 1 , Nil, N10-5 1 , N6, N7 and 
N8 were kinased by adding 1 ill of each polynucleotide (0.1 
ug/nl) and 20 units of T 4 polynucleotide kinase to a solu- 
5 tion containing 70 mM Tris-HCl at pH 7.6, 10 mM MgCl 2 , 5 mM 
DTT, 10 mM beta mercaptoethanol , 500 ug/ml of BSA. The 
solution was maintained at 37 P C for 30 minutes and the 
reaction stopped by maintaining the solution at 65 °C for 
10 minutes. The two end polynucleotides, 20 ng, of poly- 

10 nucleotides Nl and polynucleotides N12, were added to the 
above kinasing reaction solution together with 1/10 volume 
of a solution containing 20 mM Tris-HCl, pH 7.4, 2 mM MgCl 2 
and 50 mM NaCl. This solution was heated to 70 °C for 5 
minutes and allowed to cool to room temperature, approxi- 

15 mately 25 °C, over 1.5 hours in a 500 ml beaker of water. 
During this time period all 10 polynucleotides annealed to 
form the double stranded synthetic DNA insert shown in 
Figure 25A. The individual polynucleotides were 

covalently linked to each other to stabilize the synthetic 

20 DNA insert by adding 40 pi of the above reaction to a 
solution containing 50 mM Tris-HCl, pH 7.5, 7 mM MgCl 2 , 
1 mM DTT, 1 mM ATP and 10 units of T4 DNA ligase. This 
solution was maintained at 37 °C for 3 0 minutes and then 
the T4 DNA ligase was inactivated by maintaining the 

25 solution at 65'C for 10 minutes. The end polynucleotides 
were kinased by mixing 52 £tl of the above reaction, 4 /xl 
of a solution containing 10 mM ATP and 5 units of T4 
polynucleotide kinase. This solution was maintained at 
37 °C for 3 0 minutes and then the T4 polynucleotide kinase 

30 was inactivated by maintaining the solution at 65 °C for 10 
minutes . 

The completed synthetic DNA insert was ligated 
directly into a lambda Zap II vector prepared in Example 
27 that had been previously digested with the restriction 
3 5 enzymes NotI and Xhol. The ligation mixture was packaged 
according to the manufacturer's instructions using 
Gigapack II Gold packing extract (Stratagene) . The pack- 
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aged ligation mixture was plated on XLl-blue cells 
(Stratagene) • Individual Lambda Zap II plaques were cored 
and the inserted excised according to the in vivo excision 
protocol provided by the manufacturer (Stratagene) . This 
5 in vivo excision protocol converts the cloned insert from 
the Lambda Zap II vector into a plasmid vector to allow 
easy manipulation and sequencing. The accuracy of the 
above cloning steps was confirmed by sequencing the insert 
using the Sanger dideoxy method described in by Sanger et 
10 al., Proc. Natl- Acad. Sci. USA , 74:5463-5467, (1977) and 
using the manufacturer's instructions in the AMV Reverse 
Transcriptase 35 S-ATP sequencing kit (Stratagene) . The 
sequence of the resulting V H expression vector is shown in 
Figure 25A and Figure 26. 



15 B. Modified ImmunoZAP H 

To create a fusion-PCR library from hybridoma RNA for 
expressing the plurality of V H -coding DNA homologs in an EL. 
coli host cell, a vector based on the ImmunoZAP H vector 
described above was constructed. The procedure for con- 

2 0 structing the vector was performed as described above with 

the following modifications: elimination of the SacI site 
between the T 3 polymerase and NotI sites and changing the 
nucleotide base residue sequence from AAA to CAG which 
resulted in an amino acid residue change from lysine to 
25 glutamine as shown in Figures 24A and 24B. 

The individual single-stranded polynucleotides (N 17 
N 4 , N 6 and N 7 ) , which were modified from their counterparts 
listed in Table 14, are listed in Table 14 below. 

Table 14 
30 Seq. 

Id. No. 

(127) Nl) 5 1 AGCTGCGGCCGCAAATTCTATTTCAAGGAGACAGTCAT 3 1 

(128) N2) 5 1 AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3 1 

(129) N3 ) 5 1 GTTATTACTCGCTGCCCAACCAGCCATGGCCC 3 ■ 

3 5 (13 0) N4) 5 1 AGGTGCAGCTGCTCGAGAATTCTAGACTAGGTTAATAG 3' 
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(131) N5) 5 f TCGACTATTAACTAGTCTAGAATTCTCGAG 3» 

(132) N6) 5« CAGCTGCACCTGGGCCATGGCTGGTTGGG 3' 

(133) N7) 5 1 CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3' 

(134) N8) 5 1 CTATTTCATTATGACTGTCTCCTTGAAATAGAATTTGCGGCCGC 
5 3 1 

(135) N9-4) 5 1 AGGTGAAACTGCTCGAGATTTCTAGACTAGTTACCCGTAC 3' 

(136) Nil) 5 1 GACGTTCCGGACTACGGTTCTTAATAGAATTCG 3 1 

(137) N12) 5' TCGACGAATTCTATTAAGAACCGTAGTC 3 1 

(138) N10-5) 5 f CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 3' 
10 The modified ImmunoZAP H vector was created to 

eliminate an unnecessary SacI site in the ImmunoZAP H 
vector, (Example 28a, when the heavy and light chain 
vectors were combined. The modifications also improved 
the efficiency of secretion of positively changed amino 
15 acids in the amino terminus of the expressed protein. 
Inouye et al., Proc. Natl. Acad. Sci. USA . 85:7685-7689 
(1988) . 

29. Prepara tion of V L Expression Vector ImmunoZAP L 
Construction 

2 0 To express the plurality of V L coding polynucleotides 

in an E. coli host cell, a vector was constructed that 
placed the V L coding polynucleotide in the proper reading 
frame, provided a ribosome binding site as described by 
Shine et al., Nature, 254:34, (1975), provided a leader 
25 sequence directing the expressed protein to the peri- 
plasmic space and also provided a polynucleotide that 
coded for a spacer protein between the V L polynucleotide. 
A synthetic DNA sequence containing all of the above 
polynucleotides and features was constructed by designing 

3 0 single stranded polynucleotide segments of 20-40 bases 

that would hybridize to each other and form the double 
stranded synthetic DNA sequence shown in Figure 25B. The 
individual single-stranded polynucleotides (N-j-Ng) are 
shown in Table 13 above. 
35 Polynucleotides N2 , N3 , N4, N6 , N7 and N8 were 

kinased by adding 1 fil of each polynucleotide and 2 0 units 
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of T 4 polynucleotide kinase to a solution containing 70 mM 
Tris-HCl, pH 7.6, 10 mM MgCl 2 , 5 mM DDT, 10 mM 2ME , 500 
micrograms per ml of BSA. The solution was maintained at 
37 °c for 30 minutes and the reaction stopped by maintain- 
5 ing the solution at 65 °C for 10 minutes. The two end 
polynucleotides 20 ng of polynucleotides Nl and poly- 
nucleotides N5 were added to the above kinasing reaction 
solution together with 1/10 volume of a solution contain- 
ing 20 mM Tris-HCl, pH 7.4, 2 mM MgCl 2 and 50 mM NaCl. 

10 This solution was heated to 70 °C for 5 minutes and allowed 
to cool to room temperature, approximately 25°C, over 1.5 
hours in a 500 ml beaker of water. During this time 
period all of the polynucleotides annealed to form the 
double stranded synthetic DNA insert. The individual 

15 polynucleotides were covalently linked to each other to 
stabilize the synthetic DNA insert with adding 40 ^1 of 
the above reaction to a solution containing 50 Tris- 
HCl, pH 7.5, 7 mM MgCl 2 1 mM DTT, 1 mM ATP and 10 units of 
T4 DNA ligase. This solution was maintained at 37 °C for 

20 3 0 minutes and then the T4 DNA ligase was inactivated by 
maintaining the solution at 65 P C for 10 minutes. The end 
polynucleotides were kinased by mixing 52 /xl of the above 
reaction, 4 fil of a solution recontaining 10 mM ATP and 5 
units of T4 polynucleotide kinase. This solution was 

25 maintained at 37 °C for 3 0 minutes and then the T4 poly- 
nucleotide kinase was inactivated by maintaining the 
solution at 65 °C for 10 minutes. 

The completed synthetic DNA insert was ligated 
directly into a Lambda Zap II vector prepared in Example 

3 0 27 that had been previously digested with the restriction 
enzymes NotI and Xhol. The ligation mixture was packaged 
according to the manufacturer's instructions using 
Gigapack II Gold packing extract and the packaged ligation 
mixture was plated on XLl-Blue cells as described in 

35 Example 28A. Individual lambda Zap II plaques were cored 
and the inserts excised according to the in vovo excision 
protocol as described in Example 28A. This in vivo 
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excision protocol converts the cloned insert from the 
Lambda Zap II vector into a phagemid vector to allow easy 
manipulation and sequencing and also produces the phagemid 
version of the V L expression vectors. The accuracy of the 
5 above cloning steps was confirmed by sequencing the insert 
using the Sanger dideoxy method described by Sanger et 
al., Proc. Natl. Acad, Sci. USA . 74:5463-5467, (1977) and 
using the manufacturer's instructions in the AMV reverse 
transcriptase 35 S-dATP sequencing kit (Stratagene) . The 

10 sequence of the resultin V L expression vector is shown in 
Figure 25B and Figure 27) . 

The V L expression vector used to construct the V L 
library was the phagemid produced to allow the DNA of the 
V L expression vector to be determined. The phagemid was 

15 produced, as detailed above, by the in vivo excision pro- 
cess from the Lambda Zap V L expression vector (Figure 27) . 

3 0 . Construction of V U L -Expression Vectors and Library 
A. Ligation of Dicistronic DNA Molecules with Modified 
ImmunoZAP H 

2 0 In preparation for cloning a library enriched in V H - 

V L -coding (V HL ) dicistronic DNA molecules, PGR amplified 
products (human or mouse) prepared in Examples 24, 25, and 
26 (50 mM NaCl, 25 mM Tris-HCl, pH 7.7, 10 mM MgCl 2 , 10 mM 
/3-mercaptoethanol, 100 ug/ml BSA, at 37 °C were digested 
25 with restriction enzymes Xhol and Xbal at a concentration 
of 60 units of enzyme per ug of DNA, and purified on a 1% 
agarose gel. After gel electrophoresis of the digested 
PCR amplified dicistronic DNA molecules, the region of the 
gel containing the DNA fragments of approximately 13 60 

3 0 base pairs in size was excised, purified using Gene-Clean 

(BIO 101, La Jolla, California) , ethanol precipitated and 
resuspended in 10 mM Tris-HCl, pH 7.5, and 1 mM EDTA to a 
final concentration of 10 ng/ul. Equimolar amounts of the 
insert were then ligated overnight at 4°C to 1 ug of 
35 modified ImmunoZAP H vector, prepared in Example 28b, 
(Stratagene) previously digested with Xhol and Xbal. A 
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portion of the ligation mixture (1 ul) was packaged for 2 
hours at room temperature using Gigapack Gold packaging 
extract (Stratagene) and the packaged material was plated 
on a permissive E. coli (strain XLl-blue) lawn to generate 
5 plaques. The library was determined to consist of predom- 
inantly V HL with less than 5% non-recombinant background. 

B. Screening of Antibody- P roducing Plaques 
(i) Human 

To screen for expression of V HL dicistronic molecules, 
10 E. coli were infected to yield approximately 100 plaques 
per plate. Replica filter lifts of the plaques on an agar 
plate were produced by overlaying a nitrocellulose filter 
that had been soaked in 10 mM isopropyl beta- 
dithiogalactopyranoside on each plate with transfer for 
15 15 hours at 23 °C. For detection of V HL antibody fragment 
expression, the filters were screened with rabbit anti- 
human heavy and light chain antibodies followed by goat 
anti-rabbit antibody coupled to alkaline phsophatase 
(Cappel Laboratories, Malver, Pennsylvania) . The detec- 

2 0 tion of immunoreactive product confirmed the presence and 

expression of V HL antibody fragments. 

To identify human DNA clones expressing antibody that 
bound TT, plaques were plated and proteins expressed as 
described above. Replica filters were incubated with 0.2 
25 nN 125 I-tetanus toxoid and washed. Positive plaques were 
identified by autoradiography and isolated. The frequency 
of positive clones in the library was equivalent to 
(number of positive clones) / [number of plaques screened) 
X (fraction of plaques expressing V HL ) . Concentrated non- 
30 adsorbed tetanus toxoid was iodinated with sodium iodide 
125 I (ICN, Irvine, California) by the Choramine-T method as 
described in Botton et al., Biochem. I, , 133:529-539 
(1973) and available in a kit (Iodo-Beads, Pierce, 
Rockford, Illinois) . 

3 5 Human DNA clones were re-plated at approximately 100 

phage per plaque side by side with the parental phage that 
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were used as templates for PCR amplification and screened 
in the primary antigen binding screen. The results of the 
screening procedure are seen in Figure 28. Similar 
signals between the parental clones and the V HL dicistronic 
5 DNA molecules demonstrated that the sequence differences 
introduced with the C H 1' and V L primers did not adversely 
affect gene expression. Also, it should be noted in 
Figure 28 that a random parental clone that did not react 
with tetanus toxoid, 7G1, was unreactive before and after 
10 the PCR dicistronic fusion, as was the control ImmunoZAP 
H vector (IZ H) . 

(ii) Mouse 

Mouse antibody-producing plaques prepared in Example 
27 were screened for antibody expression with rabbit anti- 
15 mouse heavy and light chain antibody (Cappel Laboratories) 
as described above. 

3 1 . Characterization of Cloned Dicistronic V 1 |L Repertoire 

in Expression Library 
A. Verification of Presence and Size of Cloned 

20 Dicistronic V |I L Repertoire 

Bacteriophage from purified reactive plaques prepared 
in Example 3 OB were converted to the plasmid format by in 
vivo excision with R408 helper phage according to manufac- 
turer's protocol (Stratagene) and also described in Short 

25 et al., Nucl. Acids Res. , 16:7583-7600 (1988). In the in 
vivo excision protocol, the cloned insert from the 
ImmunoZAP H vector was converted into a phagemid vector to 
allow easy manipulation and sequencing. Briefly, phage 
plaques were cored from the agar plates and transferred to 

3 0 sterile microfuge tubes containing 500 ul of a buffer 
containing 50 mM Tris-HCl, pH 7.5, 100 mM NaCl, 10 mM 
MgS0 A , and 0.01% (w/v) gelatin and 20 ul of chloroform. 

For excisions, 2 00 ul of the phage stock, 200 ul of 
XLl-Blue cells (A^ = 1.00) and 1 ul of R408 helper phage 

35 (1 x 10 10 plaque forming units Opfu)/ml) were incubated at 
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37 °C for 15 minutes. AFter a 4 hour incubation in Luria- 
Bertani (LB) broth and heating at 70" C for 2 0 minutes to 
heat kill the XLl-blue cells, the phagemids were re- 
infected into XLl-Blue cells and plated onto LB plates 
5 containing ampicillin. Double stranded DNA was prepared 
from the phagemid containing cells according to the meth- 
ods described by Holmes et al., Anal, Biochem. , 114: 193 , 
(1981) . Clones were first screened for DNA inserts by 
restriction digests with Xhol and Xbal. The detection of 
10 1390 base pair fragment on an agarose gel confirmed the 
presence of a V HL dicistronic molecule insert. 

B. Sequencing of Plasmids from Expression Library 

Clones containing the putative V HL insert were 
sequenced using reverse transcriptase according to the 

15 general method described by Sanger et al . , Proc. — Natl . 
Acad. Sci.. USA , 74:5463-5467, (1977) and the specific 
modifications of this method provided in the manufac- 
turer's instructions in the AMV reverse transcriptase 35 S- 
dATP sequencing kit (Stratagene) . 

2 0 Nucleotide sequence analysis of several fusion clones 

indicated that the sequence of the fusion region was 
identical to that shown in Figure 22, proving that the 
clones were actually generated through a fusion PCR 
intermediate . 

25 C. Advantages of Fusion-PCR to Produce Dicis tronic DNA 
Molecules 

PCR amplification can, therefore, be used to fuse 
sequences responsible for encoding subunits of a hetero- 
dimeric protein together into a single DNA fragment that 
30 can then direct the expression of both subunits from one 
expression vector. In the case of antibodies, if the 
source of nucleic acid template comes from hybridoma mRNA, 
there is only one heavy and light chain sequence to choose 
from, and thus the heavy: light pair is a "natural" pair. 
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However, if spleen, peripheral blood B-cell, or other 
lymphocyte mRNA is used as the source of template, the PCR 
fusion reaction to form a dicistronic DNA molecule can 
randomly pair heavy and light chains from different cells, 
5 producing a combinatorial library. In such a library, 
only a small fraction of the clones contain the original 
heavy and light chain pairs. This may not be a problem if 
the desired natural pair is well represented in the orig- 
inal B-cell population, as is the case with hyperimmunized 

10 donors. However, if one wishes to find a naturally 
occurring rare specificity in a combinatorial library, one 
may have to screen a large number of clones. 

The fusion method presented here may offer a solution 
to the random combinatorial problem. If one begins with 

15 a very dilute population of B-cells (possibly in a medium 
that limits diffusion) , it may be possible for the dicis- 
tronic event to occur between naturally paired heavy and 
light chain sequences before significant mixing between B- 
cell RNA occurs. Thus, the fused heavy and light chain 

20 sequences would be the original pairs, and the resulting 
library would express predominantly the naturally occur- 
ring antibody specificities. Such a library would be 
highly preferable when rare natural specificities are 
sought . 

25 Another advantage to this method is that only one 

vector and one cloning step are necessary. This saves a 
substantial amount of time, resources, and effort. 
Moreover, the ease of the single PCR reaction greatly 
simplified the process of going from B-cell RNA to an E^. 

3 0 coli library, making this approach a noteworthy alterna- 
tive to standard hybridoma technology. 

The foregoing is intended as illustrative of the 
present invention but not limiting. Numerous variations 
and modifications can be effected without departing from 

35 the true spirit and scope of the invention. 
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Claims ; 

1. A method of producing a nucleic acid vector 
encoding two or more desired genes, each from a family of 
genes, said genes being capable of together producing a 

5 characteristic that can be used to identify the vector 
encoding said desired genes from other vectors encoding 
other combinations of genes from said families of genes, 
which method comprises: 

a) randomly inserting into vectors one member from 

10 a first family of genes and one member from one or more 
other families of genes so that a population of vectors 
are created wherein each vector may contain one of the 
genes from said first gene family and one of the genes 
from each of said other gene families; 

15 b) identifying within said population of vectors a 

vector capable of detectably producing a desired charac- 
teristic resulting from the inclusion of one gene from 
said first gene family and one gene from each of said 
other gene families, and using said characteristic to 

2 0 distinguish the vector from other vectors within the 

population containing undesired combinations of gene 
members from said gene families. 

2. The method of claim 1 wherein said genes are 
inserted into a DNA vector at one or more integration 

25 sites, which method further comprises: 

a) preparing said vectors with one or more site- or 
region-specific recombination sequences; 

b) permitting, in the presence of one or more 
reagents facilitating said site- or region-specific 

3 0 recombination, a member of said first family of genes to 

combine in a vector with a member of said second family of 
genes . 

3 . The method of claim 2 wherein said site- or 
region-specific recombination site is recognized and acted 

35 on by flp recombinase. 
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4 . The method of claim 2 wherein said site- or 
region-specific recombination site is recognized and acted 
on by ere recombinase. 

5. The method of claim 2 wherein said site- or 
5 region-specific recombination site is recognized and acted 

on by lambda integrase recombinase. 

6. The method of claim 2 wherein at least one of 
the vectors contains a sequence capable of being 
recognized and acted on by transposase. 

10 7. The method in claim 1 where said genes are 

inserted into a DNA vector at one or more integration 
sites, which method further comprises: 

a) cleaving said vector with one or more site- 
specific integration reagents; 

15 b) preparing the ends of genes from said first 

family of genes so that one end will ligate with an end of 
the vector cleaved by a first reagent and the other with 
an end of the vector cleaved by a second reagent; 

c) preparing the ends of said genes from said other 

2 0 gene families so that one end will ligate with an end of 

the vector cleaved by a third reagent and the other with 
an end of the vector cleaved by a fourth reagent; 

d) preparing at least one double stranded DNA 
linker fragment having one end ligatable to one end of 

25 said genes from said first family of genes and the other 
end ligatable to one end of genes from said other family 
of genes; 

e) mixing said vector, genes, and said linker 
fragment or fragments together in a ligation mix and 

3 0 ligating the components. 

8. The method of claim 7 wherein said reagents are 
the same. 
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9. the method of claim 8 wherein said reagents are 
different. 

10. The method of claim 1, wherein said combination 
of genes is accomplished in vivo. 

5 11. A method of producing a host cell expressing two 

or more desired genes, each from a family of genes, said 
genes being capable of together producing a characteristic 
that can be used to identify the host cell expressing said 
desired genes from other host cells expressing other 
10 combinations of genes from said families of genes, which 
method comprises: 

a) randomly introducing into host cells one member 
from a first family of genes and one member from one or 
more other families of genes so that a population of host 

15 cells are created wherein each host cell may contain one 
of the genes from said first gene family and one of the 
genes from each of said other gene families; 

b) identifying within said population of host cells 
a host cell capable of detectably exhibiting a desired 

2 0 characteristic resulting from the inclusion of one gene 

from said first gene family and one gene from each of said 
other gene families, and using said characteristic to 
distinguish the host cell from other host cells within the 
population containing undesired combinations of gene 
25 members from said gene families. 

12. The method of claim 11 wherein said vectors are 
lambda bacteriophage vectors and the host cells are E. 
Coli. 

13 . A method of producing a nucleic acid vector 

3 0 encoding two or more genes belonging to families of genes, 

being capable of producing a characteristic that can be 
used to identify the vector encoding said genes from other 
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vectors encoding other members of the families of genes 
which method comprises: 

a) isolating a first population of vectors for 
which each member of said population may contain one 

5 member of a family of genes; 

b) inserting one member of a second family of genes 
into each of the vectors so that a population of vectors 
are created where each vector may contain one of the genes 
from said first family and one of the genes from said 

10 second family; 

c) identifying within said population of vectors a 
vector capable of producing a characteristic resulting 
from the inclusion of one gene from said first gene family 
and one gene from said second gene family, and using said 

15 characteristic to distinguish the vector from other 
vectors within the population containing other members of 
the gene families. 

14 . A method of producing a nucleic acid vector 
encoding two or more genes belonging to families of genes, 
20 said genes being capable of producing a characteristic 
that can be used to identify the vector encoding said 
genes from other vectors encoding other members of the 
families of genes, which method comprises: 

a) isolating a first population of vectors, for 
25 which each member of said population may contain one 

member of a first family of genes and a nucleic acid site 
or region at which the population of vectors can be 
combined with a second population of vectors; 

b) isolating a second population of vectors, for 
3 0 which each member of said population may contain one 

member of a second family of genes and a nucleic acid site 
or region at which the second population of vectors can be 
recombined with said first population of vectors so that 
one member of the first family of genes and one member of 
35 the second family of genes may be combined and expressed 
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in each member of a diverse population of recombined 
vectors ; 

c) recombining populations of said first and second 
vectors and at said nucleic acid site or region thereby 

5 creating a diverse population of recombinant vectors each 
of which may express one member of the first family of 
genes and one member of the second family of genes ; 

d) identifying within said population of 
recombinant vectors a vector capable of producing a 

10 characteristic resulting from the inclusion of one gene 
from each of said gene families. 

15. The method of claim 14 wherein said nucleic acid 
site is cleaved with site-specific reagent, which method 
further comprises: 

15 a) cleaving said first vector population with said 

reagent ; 

b) cleaving said second vector population with said 
reagent ; 

c) mixing both vector populations together in a 

2 0 ligation mix and ligating the two populations. 

16. The method of claim 14 wherein said nucleic acid 
region is a homologous region capable of undergoing homo- 
logous recombination, which method further comprises 
inserting one or more members of said first and second 

25 populations into a single host capable of carrying out 
homologous recombination and allowing such homologous 
recombination to occur. 

17. The method of claim 14 wherein said nucleic acid 
site is a target site for site-specific recombination, 

3 0 which method further comprises inserting one or more 

members of said vector populations into a single host 
capable of carrying out site-specific recombination at 
said nucleic acid site and allowing said site-specific 
recombination to occur. 
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18. The method of claim 17 wherein said target site 
for site-specific recombination is of the family of sites 
selected from flp, lox, and gamma-delta. 

19. The method of any of claims 1, 11, 13 or 14 
5 wherein said vectors are plasmid or cosmid vectors. 

20. The method of any of claims l f 11, 13 or 14 
wherein said vectors are phage vectors. 

21. The method of any of claims 1, 13 or 14 wherein 
said vectors are lambda bacteriophage vectors. 

10 22. The method of claim 14 wherein the identifica- 

tion of a particular vector within the recombinant vector 
population involves the interaction of sequence-specific 
nucleic acids with genes from said first and second 
families of genes. 

15 23. The method of claim 14 wherein the identifica- 

tion of a particular vector within the recombinant vector 
population involves the hybridization of nucleic acid 
probes with genes from said first and second of families 
of genes. 

20 24. The method of claim 14 wherein the identifica- 

tion of a particular vector within the recombinant vector 
population involves the expression of one or both of genes 
from said gene families as an RNA molecule. 

25. The method of claim 14 wherein the identifica- 
25 tion of a particular vector within the recombinant vector 
population involves the expression of one or both of genes 
from said gene families as an identifiable protein 
molecule. 
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26. The method of claim 25 wherein the protein 
molecule (s) contains a binding site for another molecule. 

27. The method of claim 26 wherein the protein 
molecule (s) contains an epitope recognized by an antibody. 

5 28. The method of claim 27 wherein the protein 

molecule (s) contains an immune molecule binding site for 
an epitope. 

29. The method of claims 14 wherein both genes 
express an RNA and/or polypeptide and said RNAs and/or 

10 polypeptides physically interact within a host to create 
said characteristic. 

30. The method of claim 29 wherein both genes 
express polypeptides that physically interact to form a 
neo-epitope recognized by an immune molecule. 

15 31. The method of claim 2 9 wherein both genes 

express polypeptides that physically interact to form a 
binding site for another molecule. 

32. The method of claim 31 wherein the polypeptides 
are derived from antibody genes such that the interaction 

2 0 of both polypeptides forms an antigen binding site. 

33. The method of any of claims 1, 11 , 13 or 14 
wherein the vectors contain a single promoter that 
expresses the genes from said gene families. 

34. The method of any of claims 1, 11, 13 or 14 
25 wherein said genes from said gene families are each 

expressed from their own promoter. 

35. The method of claim 11 wherein the host is a 
mammalian cell. 
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36. The method of claim 11 wherein the host is a 
eukaryotic cell. 

37. The method of claim 11 wherein the host is a 
prokaryotic cell. 

5 38. The method of any of claims 1, 11, 13 or 14 

wherein there are more than two gene families and the 
vectors produced contain a random assortment of one member 
of each gene family needed to create said characteristic. 

39. HCFLP. 

10 40. LCFLP. 

41. A method of producing a biological agent having 
a desired phenotype wherein said phenotype results from 
expression of a particular combined nucleotide sequence 
and wherein said phenotype can be used to identify the 

15 biological agent having the particular combined nucleotide 
sequence which comprises: 

(a) bringing together a first population of 
nucleotide sequences with one or more other populations of 
nucleotide sequences to produce combined nucleotide 

2 0 sequences wherein each separate combined nucleotide 

sequence comprises one member of each population of 
nucleotide sequences ; 

(b) expressing said combined nucleotide sequences in 
biological agents; and 

25 (c) identifying those biological agents which 

express said desired phenotype. 

42. A method according to claim 41 wherein said 
phenotype can be used to distinguish the biological agent 
from bioological agents having other combined nucleotide 

3 0 sequences further comprising using said phenotype to 

distinguish those biological agents expressing the 
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particular combined nucleotide sequence from biological 
agents having other combined nucleotide sequences. 

43. A method according to claim 41 wherein said 
biological agent is a cell. 

5 44. A method according to claim 41 wherein said 

biological agent is nucleic acid vector. 

45. A method according to claim 41 wherein said 
biological agent is a bacteriophage or virus. 

46. A method according to claim 41 wherein said 
10 phenotype results from expression of a hybrid polypeptide 

which is encoded by the particular combined nucleotide 
sequence and is encoded at least in part by one nucleotide 
sequence from each population of nucleotide sequences 
which was brought together. 

15 47. A method according to claim 41 wherein said 

phenotype results from expression of a plurality of 
polypeptides wherein a polypeptide is encoded at least in 
part by one nucleotide sequence from each separate 
population of nucelotide sequences which was brought 

2 0 together . 

48. A method according to claim 41 wherein two 
populations of nucleotide sequences are combined. 

49. A method according to claim 47 wherein said 
phenotype results from expression of a heterodimeric 

25 polypeptide wherein one subunit of said dimer is encoded 
at least in part by the nucleotide sequence from the first 
population of nucleotide sequences and the other subunit 
of said dimer is encoded at least in part by the nucleic 
sequence from the second population of nucleotide 

3 0 sequences. 
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50. A method according to claim 48 wherein said 
phenotype results from expression of a first polypeptide 
encoded at least in part by the nucleotide sequence from 
the first population of nucleotide sequences and of a 

5 second polypeptide encoded at least in part by the 
nucleotide sequence from the second population of 
nucleotide sequences. 

51. A method according to claim 48 wherein said 
phenotype results from expression of an RNA molecule 

10 encoded at least in part by the nucleotide sequence from 
the first population of nucleotide sequences and a second 
RNA molecule encoded at least in part by the nucleotide 
sequence from the second population of nucleotide 
sequences . 

15 52. A method according to claim 48 wherein said 

phenotype results from synthesis of an RNA molecule 
encoded at least in part by the nucleotide sequence from 
the first population of nucleic acid sequences and by the 
nucleic acid sequence from the second population of 

2 0 nucleic acid sequences. 

53. A method according to claim 48 wherein the first 
and second populations of nucleotide sequences are 
combined by co-infection or co-transformation of host 
cells. 

25 54. A method according to claim 48 wherein members 

from said first and second populations of nucleotide 
sequences are combined randomly to give combined 
nucleotide sequences. 

55. A method according to claim 41 wherein the 

3 0 combining of said populations of nucleotide sequences 

gives a combined nucleotide sequence which was not 
previously expressed in said biological agent. 
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56. A method according to claim 41 wherein said 
desired phenotype comprises a phenotype which was not 
previously expressed in a population of such biological 
agents . 

5 57. A method according to claim 41 wherein said 

first population of nucleotide sequences comprises non- 
identical nucleotide sequences. 

58. A method according to claim 41 wherein each 
population of nucleotide sequences comprises non-identical 

10 nucleotide sequences. 

59. A method of producing a nucleic acid vector 
encoding a preselected combined nucleotide sequence which 
comprises two or more preselected nucleotide sequences, 
each independently selected from a population of nucleo- 

15 tide sequences, said combined nucleotide sequence being 
capable of producing a characteristic that can be used to 
identify the vector encoding said preselected combined 
nucleic sequence comprises 

(a) bringing together a member nucleotide sequence 
2 0 from each population of nucleotide sequences to give a 

population of combined nucleotide sequences wherein each 
combined nucleotide sequence comprises a nucleotide 
sequence from each population; 

(b) inserting into vector a member of the population 

2 5 combined nucleotide sequences so that a population of 

vectors is created wherein each vector may contain a 
combined nucleic acid sequence; 

(c) identifying within said population of vectors, 
a vector capable of detectably producing a desired 

3 0 characteristic resulting from inclusion of the preselected 

combined nucleic acid sequence. 

60. A method according to claim 59 wherein said 
characteristic can be used to distinguish the vector 
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encoding the preselected combined nucleotide sequence from 
other vectors encoding other combinations of nucleotide 
sequences further comprising using said characteristic to 
distinguish the vector from other different vectors within 
5 the population having unselected combined nucleotide 
sequences . 

61. A method according to claim 60 wherein said 
nucleotide sequences are combined randomly. 

62. A method according to claim 61 wherein said 
10 combined nucleotide sequences are produced using fusion 

polynucleotide amplification. 

63. A method according to claim 59 wherein said 
combined nucleotide sequences are produced using fusion 
polynucleotide amplification. 

15 64. A method according to claim 1 wherein a 

dicistronic or multicistronic DNA sequence which comprises 
one member from the first family of genes and one member 
from one or more than families of genes which comprises a 
random combination of said members of said families of 

20 genes is synthesized using fusion polynucleotide 
amplification and inserted into vectors. 

65. A method for producing a biological agent having 
a desired novel phenotype wherein said phenotype results 
from expression of a particular combined nucleotide 

25 sequence and wherein said phenotype can be used to 
identify the biological agent having x the particular 
combined nucleotide sequence; which comprises: 

(a) replicating at least portions of at least two 
parent nucleotide sequences under conditions that allow 

3 0 mutations to occur in either nucleotide sequence to 
generate a population of diverse replicas of each parent 
nucleotide sequence; 
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(b) randomly bringing together the populations of 
diverse replicas to produce combined nucleotide sequences 
wherein each combined nucleotide sequence comprises one 
member of each population of diverse replicas; 
5 (c) expressing said combined nucleotide sequences in 

biological agents ; and 

(d) identifying those biological agents which 
express said desired phenotype. 

66. A method according to claim 65 wherein said 
10 desired phenotype is distinguishable from phenotypes 

expressed by said parent nucleotide sequences. 

67. A method according to claim 66 wherein said 
phenotype can be used to distinguish it from biological 
agents having other combined nucleotide sequences using 

15 said phenotype to distinguish those biological agents 
expressing the particular combined nucleotide sequence 
from biological agents having other combined nucleotide 
sequences. 

68. A method according to claim 65 wherein the 
2 0 parent nucleotide sequences comprise a single DNA molecule 

and are replicated together; further comprising separating 
the populations of diverse replicas of each parent 
nucleotide sequence prior to bringing together step (b) . 

69. A method according to claim 68 which comprises 
25 replicating two parent nucleotide sequences. 

70. A method according to claim 65 wherein the 
parent nucleotide sequences are separately replicated. 

71. A method according to claim 7 0 which comprises 
replicating two parent nucleotide sequences. 
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72. A method according to 71 wherein a first parent 
nucleotide sequence is replicated in one population of 
cells and a second parent nucleotide sequence is repli- 
cated in a second population of cells and said cell 

5 populations are mixed and fused to generate cells which 
express combined nucleotide sequences. 

73. A method according to claim 72 wherein said 
first parent nucleotide sequences codes for a selected V L 
and said second parent nucleotide sequences codes for a 

10 selected V H , said cells are E. coli; and said combined 
nucleotide sequences express a Fab. 

74 . A method for producing a biological agent having 
a desired phenotype wherein said phenotype results from 
expression of a particular combined nucleotide sequence 

15 and wherein said phenotype can be used to identify the 
biological agent having the particular combined nucleotide 
sequence which comprises: 

(a) replicating parent populations of nucleic acid 
sequences to generate a population of diverse replicas of 

2 0 each parent population: 

(b) randomly bringing together the populations of 
diverse replicas to produce combined nucleotide sequences 
wherein each combined nucleotide sequence comprises one 
member of each population of diverse replicas; 

25 (c) expressing said combined nucleotide sequences in 

biological agents; and 

(d) identifying those biological agents which 
express said desired phenotype. 

75. A method according to claim 74 wherein said 

3 0 desired phenotype is distinguishable from phenotypes 

expressed by said parent populations of nucleotide 
sequences . 
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76. A method according to claim 75 wherein said 
phenotype can be used to distinguish said biological agent 
from biological agents having other combined nucleotide 
sequences, further comprising using said phenotype to 
5 distinguish those biological agents expressing the 
particular combined nucleotide sequence from biological 
agents having other combined nucleotide sequences. 
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FIG. 5-1 

Subclass I (A) 
#L39 

CTCGAGTCAGGACCTGGCCTCGTGAAACCTTCTCAGTCTCTGTCTCTC 
arr^nr^rTCTnArT^GCTACTCgATCACC AGTGCTTATTACTGGAAC 

TGGATCCGGCAGTT 



Subclass II (A) 
#L11 

CTCGAGTCTGGGCCTnAACTGGCAAAACCTGGGGCCTCAGTGAAGATG 
TCCTG CAAGG CTT CTGGC CACAC CTTGAC TAGTTACTGGATACACT GG 
GTAAAAnAGAGGCC 

#L03 

CTCGAGTCTGGACCTnAGCTGGTAAAGCCTGGGGTTCAGTGAAGATGT 
COTGCAAGGCTTOTGGATACACATTCAC^ lAGCTATGTTATACACT GGG 
TGAAGCAGAAGCCT 

FIG. 5-2 

#L3 2 

CTCGAGTCTGGACCTGAACTGGTAAAGCCTGGGACTTCAGTGAAGATG 
TCCTGCAAGGCTTCTGGATACACATTCACC AGCTATGTTATGCGCT GG 
GTGAAGCAGAAGCC 

Subclass II (B) 
#L3 7 

CTCGAGTCAGGGGCTGAACTGGTGAAGCCTGGGGTTTCAGTGAAGTTG 
TCCTGCAAGGCTTCTGGCTACACCTTCACn AGCTACTATATGTACT GG 
GTGAAGCAGAGGCC 



#L06 

CTCGAGTCTGGGGCTAAGCTGGTAAGGCCTGGAGCTTnAGTnAAGCTG 
TCCTGnAGGGCTTCTGGCTACTCCTTCAC nAGCTACTGGATGAACT GG 
GTGAAGCAGAGGCC 
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FIG. 5-3 

Subclass II (C) 
#L33 

CTCGAGTCTGGGGCTGAGCTGGTGAGGCCTGGAGCTTCAGTnAAGCTG 
TCCTGCAAGGCCTCTCGTACTCCTTCACCAG^TCCTGATAACTGGGTG 

AAGCAGAGGCCTGG 

Subclass III (B) 
#L3 6 

CTCGAGTCAGGAGGTGGCCTGGTGCAGCCTGGAGGATCCCTGAAACTC 
TCCTGTGCAGCCTCAGGATTCGATTTnAGllAGATACT^GjG^TG^ATTGG 

GTCCGGCAGCTCCA 



#L02 

CTCGAGTCTGGAGGTGGCCTGGTGCAGCCTGGAGGATCCCTGAATCTC 
CCCTCTGrA r;^^^ A ^^ a rprpo ri a'T^TnRr:riAf:ATAATGGATGAGTTGG 

GTTCGGCAGGCTCC 

FIG. 5-4 

#L31 

CTCGAGTCTGGAGGTGGCCTGGTGCAGCCTGGAGGATCCCTGAAAGTC 
^r^CTr.rAarrrrar,f;aTTf;r;ATTTTiAG nAGATACTGGATGAGTT GG 
GTCCGGCAGCTCCA 

#L34 

CTCGAGTCTGGAGGTGGCCTGGTGCAGCCTGGAGGATCCCTCAAACTC 
^o^CTGrAGrrTgAGGATTCGATTTrLAG nAGATACTGGATGAGTT GG 
GTCCGGCAGCTCCA 

#L50 

CTCGAGTCAGGAGGTGGCCTGGTGCAGCCTGGAGGAGCCCTGAAACTC 
^nr^CTrinA^rOTCAGGATTCGATTTnAG nAGATACTGGATGAGTT GG 
GTCCGCAGCTCCAG 
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FIG. 5-5 

Subclass III (C) 
#L10 

CTCGAGTCTGGGGGAGGCTTAGTnCAGCCTGGAGGGTCCCGGAAACTC 

TCCTGTGC^ 
ATTCGTCAGGCTCC 

#L08 

CTCGAGTCTGGGGGAGGCTTAGTnnAGCCTGGAGGGTCCCGGAAACTC 

TCCTGTGCAGCCZTCTGGATTCACTTTnAG 

GTTACGTCAGGCTC 



Subclass V (A) 
#1,3 8 

CTCGAGTCAGGGGCTGAACTGGTGAGGCCTGGGCGTTCAGTnAAGATG 

TCCTGCAAGGC^C^GGCTATTCCTTCACC 

GTGAAACAGAGGCC 

FIG. 5-6 

Miscellaneous 
#47 

CTCGAGTCAGGGGCTGAACTGGCAAAACCTGGGGCCTCAGTAAAGATG 
TCCTGCAAGGCTTCTGGCTACACCTCTTCTTCCTTCTGGCTGCACTGG 

ATAAAAGAAGGCCT 
#L3 5 

CT CG AGT CTGG AC CTnAGCTGGTGAAGCCTGGGGTTCAGTTAAAATAT 

CCTGCAAGGCTTCTGGTTACTCATTTTCTnTCTACrr^ 

TGATGCAGAGCCAT 



#L48 

CTCGAGTCAGGGGCTGAACTGGTGAAGCCTGGGGTTCAGTAAGTTGTC 
CTGAAGGCTTCTGGCTACACCTTCACCGGCTACTATATGTACTGGGTG 

AAGCAGAGGCCTGG 
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FIG. 6A-1 

V u TTYPRESSJ nN VICTOR: 

SHINE-DALGARNO MET 



GGCCGCAAATTCTATTTCAAGGAGACAGTCATAATG 
CGTTTAAGATAAAGTTCCTCTGTCAGTATTAC 



leader sequence 

'aaatacctattgcctacggcagccgct 
tttatggataacggatgccgtcggcga 



LEADER SEQUENCE 

ggattgttattactcgctgcccaaccag' 
cctaacaataatgagcgacgggttggtc 



FIG. 6A-2 

LINKER LINKER 



NCOI 



V H BACKBONE! XHOI SPEI 



EcATGGCCC'AGGTGAAACTdbTCGAGATTTCTAGACTAGT 
GGTACCGGGTCCACTTTGACGAGCTCTAAAGATCTGATCA 

' STOP LINKER 

TyrProTyrAspValProAspTyrAlaSer , 

TACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTCG 
ATGGGCATGCTGCAAGG CCTGATG CCAAGAATTATCTTAAG CAG CT 
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FIG. 6B-1 



V. EXPRESSION VECTOR: 



SHINE— DALGARNO 

GGCCGCAAATTCTATTTCAAGGAGACAGTCATA 1 
CGTTTAAGATAAAGTT C CT CTGTCAGTAT 



MET LEADER SEQUENCE 

ATGAAATACCTATTGCCTACGGCAGCCGCTGGA 
TACTTTATGGATAACGGATGCCGTCGGCGACCT 



LEADER SEQUENCE 

TTGTTATTACTCGCTGCCCAACCAG' 
AACAATAATGAGCGACGGGTTGGTC 



FIG. 6B-2 

LINKER 

™~ V H BACKBONE 



NCOI 



CCATGGCCCAGGTGAAACTG 
GGTACCGGGTCCACTTTGAC 



LINKER 



XHOI SPEI 


STOP 


'CTCGAGAATTCTAGACTAG'] 


t'TAATAG' 



GAGCTCTTAAGATCTGATCAATTATCAGCT 
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•5' 



AAAA-3' 



5" 1 

BEGIN PCR 
5'. V 



AAAA-3" 
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| PCR CYCLES 
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HI 
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3' 
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• DNA SYNTHESIS AND CONTINUE PCR 
J WITHV H AND C j_ PRIMERS 
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HEAVY CHAIN 
HOMOLOGY 



Spe I I Stops ribl 



pe I B LEADER 
M K Y L L P T 

5 ' -GCCTACGG 

3' 1CTCGGGTTTAGA"ACA|TGATCA ATTATC GTTCCTCT6TCAGTATTACTTTATGGATAACGGAT6CC 

TREAM PRIMER &\\\') 



HEAVY CHAIN DOWNS 



LIGHT CHAIN 
HOMOLOGY 



AAAGLFF I A A Q P A M A Sac 

r^rrqpTgp.tTTP.TTATTAATP.cr.TCCCP.AACCTCCCATGGCTGAGCTC |GT6ATGACCCACTCTCC- 

GTCGGCG ACCTAACAATAA- 5 



3' 



LIGHT CHAIN UPSTREAM PRIMER (V L ) 



FIG. 22. 
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MODIFIED V H EXPRESSION VECTOR: 



Not I RIBOSOME BINDING SITE 

5' GAGCTGCGGCCGCAAATTCTATTTCAA66A6ACA6TCATA 
3' CGCCGGCGTTTAAGATTAAAGTTCCTCTGTCAGTAT 



Pel B LEADER 

MetLysTyrLeuLeuProThrAlaAloAloGlylfiuLeuLeuLeuAla 

ATGAAATACCTATTGCCTACGGCAGCCGCTCGATTGnATTACTCGCT 

TACTTTATGGATAACGGATGCCGTCGGCGACCTAACAATAATGAGCGA 

FIG. 24a. 

Ncol Xhol Xbal Spel 

AloGlnProAlaMetAlaGlnVolGlnLeuLeuGkj Thr 
GCCCAACCAGCCATCGCCCAGGTGCAGCTGCTCGAGATTTCTAGACT 
CGGGTTGGTCGGTACCGGGTCCACGTCGACGAGCTCTAAAGATCTGA 



EcoRI 

SerTyrProTyrA^)VblProAspTyrGlySerStop 

AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTCG 

TCAATGGGCATGCTGCAAGGCCTGATGCCAAGAATTATCTTAAGCAGCT 
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V H EXPRESSION VECTOR: 



Not I RI80S0ME BINDING SITE 

5" GGCCGCAAATTCTATTTCAAGGAGACAGTCATA 
CGTTTAAGATAAAGTTCCTCTS1CA6TAT 



Pel B LEADER 

MetLysTyrLeuLeuProThrAlaAlaAlaGlyLeuLeuLeuLeuAlo 

ATGAAATACCTATTGCCTACGGCAGCCGCTGGATTGTTATTACTCGCT 

TACTTTATGGATAACGGATGCCGTCGGCGACCTAACAATAATGAGCGA 

Ncol Xhol XbalSpel F~IG. 25C. 

\ BACKBONE 

AlaGinPrvAlaMetAlaGlnValLysLeuLeuGlu' Thr 
GCCCAACCAGCCATGGCCCAGGTGAAACTGCTCGAGTTTCTAGACT 
CGGGTTGGTCGGTACCGGGTCCACTTTGACGAGCTCTAAAGATCTGA 



EcoRI 

SerTyrProTyrAspVolProAspTyrGlySerStop 

AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTCG 

TCAATGGGCATGCTGCAAGGCCTGATGCCAAGAATTATCTTAAGCAGCT 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



34/37 



V. EXPRESSION VECTOR: 



EcoRI RIBOSOME BINDING SITE 

5' TGAATTCTAAACTAGTCGCCAA6GAGACAGTCATA 
3' TCGAACTTAAGATTTGATCAGCGGTTCCTCTGTCAGTAT 



Pel B LEADER 

MetLysTyrLeuLeuProThrAlaAlaAloGlyLeuLeuLeuLeu 
ATGAAATACCTAnGCCTACGGCAGCCGCTGGAnGnATTACTC 

TACTTTATCGATAACGGATGCCGTCGG(»ACCTAACAATAATCAG _ 

FIG. 25b. 

Ncol Sad 

AlaAlaGlnProAlaMetAloGluLeu 

GCTGCCCAACCAGCCATGGCCGAGCTC 

CGACGGGTTGGTCGGTACCGGCTCGAG 



Xbal 
Stop Stop 

GTCAGTTCTAGAGTTAAGCGGCCG 
CAGTCAAGATCTCAATTGGCCGGCAGCT 
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